(19) 



J 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



(11) EP 0 784 093 A1 

EUROPEAN PATENT APPLICATION 



(43) Date of publication: 

16.07.1997 Bulletin 1997/29 

(21) Application number: 96309363.8 

(22) Date of filing: 20.12.1996 



(51) mtci* C12N 15/12, C07K 14/715, 
C12N 5/10, A01K 67/027, 
C07K 19/00, C12N 15/62, 
C07K 16/28, C07K 1/107, 
C12Q 1/68, G01N 33/50, 
G01 N. 33/566, A61 K 38/1 7, 
A61 K 48/00, C12N 1/21 
// (C12N1/21, C12R1:19) 



(84) 


Designated Contracting States: 


• Calzone, Frank J. 




AT BE CH DE DK ES Fl FR GB GR IE IT LI LU MC 


Westlake Village, California 91361 (US) 




NL PT SE 


• Lacey, David L. 




Designated Extension States: 


Thousand Oaks, California 91320 (US) 




AL LT LV SI 


• Chang, Ming-Shi 






Newbury Park, California 91320 (US) 


(30) 


Priority: 22.12.1995 US 577788 






03.09.1996 US 706945 


(74) Representative: Brdwn, John David 






FORRESTER & BOEHMERT 


(71) 


Applicant: AMGEN INC. 


Franz-Joseph-Strasse 38 




Thousand Oaks, CA 91320-1789 (US) 


80801 Munchen (DE) 


(72) 


Inventors: 


Remarks: 


• 


Boyle, William J. 


The applicant has subsequently filed a sequence 




Moorpark, California 93021 (US) 


listing and declared, that it includes no new matter. 



(54) Osteoprotegerin 

(57) The present invention discloses a secreted 
polypeptide, termed osteoprotegerin, which is a mem- 
ber of the tumor necrosis factor receptor superfamily 
and is involved in the regulation of bone metabolism. 
Also disclosed are nucleic acids encoding osteoprote- 



gerin, polypeptides, recombinant vectors and host cells 
for expression, antibodies which bind OPG, and phar- 
maceutical compositions. The polypeptides are used to 
treat bone diseases characterized by increased resorp- 
tion such as osteoporosis. 



CO 

o 

00 

BEST AVAILABLE 

Printed by Jouve. 75001 PARIS (FR) 



Copy 



BNSDOCID: <EP 0764093A1_I_> 



1 



EP 0 784 093 A1 



Description 

Field of the Invention 

s The invention relates generally to polypeptides involved in the regulation of bone metabolism. More Particularly 

the invJntionSes to a novel polypeptide, termed osteoprotegerin, which is a member of the tumor necrosis actor 
Z^xTZertLy. The polypeptide is used to treat bone diseases characterized by increased bone loss such as 
osteoporosis. 

io Background ot the Invention 

Polypeptide growthfactors and cytokines are secreted factors which signal a wide variety of ^"gesmcellgro^h, 
differentiation and metabolism, by specifically binding to discrete, surface bound receptors. As a class of prote.ns 
fS^rS^thrSructure- and mode of signal transduction. They are characterized by hav.ng an extracellular 
SS^telSloZInn ligand binding, and cytoplasmic domain which transmits an appropriate mtrace lular s,gnaL 
^S^^t>aJ» ultimate* determine which cells will respond to a given ligand, while the structure a a 
J^£?S£ the cellular response induced by ligand binding. Receptors have been f^oX = ,^ 
Lular signals via their cytoplasmic domainsby actuating protein tyrosine, or Pf^~"~^™?SK5^ 
re a Platelet derived growth factor receptor (PDGFR) or transforming growth factor-p receptor-l (TGFpR ), by stim 
u!a?in£^ 

ducina proteins (e q TNFR-1 and Fas/APO) (Heldin, Cell 80, 213-223 (1995)). 

dUC ;L P ;umor S n ( ec^is factor receptor (TNFR) superfamily is a group of type I J^^'* j£ 

a conserved cysteine-rich motif which is repeated three to six times .n the f race,,u ^^ a ^.^ t^n e f' 
953-962 (1994)) Collectively, these repeat units form the ligand b.nd.ng domains of these receptors (Chen et al 
Chemtftry 270 2874^78 (1 995)). The ligand. for these receptors are a structurally related group of proteins homo.- - 
ogoustoT^ (Gotdde. el a, Cold Spring HarborSymp. Qua,. Biol. 51, 597*09 (,986) Nagata at al 
1449-1456 (1995)) TNFcc binds to distinct, but closely related receptors, TNFR-1 and TNFR-2. TNFa produces a 
variety of biological responses in receptor bearing cells, including, pro.iferation, different.at.on, and cytotox.c.ty and 
annntn^i^ fBeutler et al Ann. Rev. Biochem. 57 , 505-518 (1988)). 

TTN^a is beloved to mediate acute and Tronic inflammatory responses (Beutler et al. Ann. Rev. ^e"j. g, 
505-508 (1988)) Systemic delivery of TNFa induces toxic shock and widespread tissue necros.s. Because of the. 
TNF« miy be ^responsible for the severe morbidity and mortally associated with a variety of '^ct.ous d.seases in- 
cluding sepsis Mutations in FasL, the ligand for the TNFR-related receptor Fas/APO (Suda et al Cell2§. ^ 6 ^ 7Q 
M99sS is associated with autoimmunity (Fisher et al. Cell 81, 935-946 (1995)), while overproduction of FasL may be 
•iSt'Jd ?n drua induced hepatitis Thus ligands to the various TNFR-related proteins often mediate the serious 
S^inSaT^^^ that agents that neutralize the activity of these liga nds wou« haye 
therSeutb Se Soluble TNFR-1 receptors, and antibodies that bind TNFa, have been tested for their ab.l ty to 
neuTra^ 

TM^R.^ m^^ a was recently cloned, and its product tested for its ability to neutralize TNFa activity jn vitro and iD ^2 
mohno eTal PUaI UsIs*. 8331 -8335 (1 990)): The ability of this protein to neutralize TNFa suggests hat soluble 
TN "ecepSr Unction to^nd and clear TNF thereby blocking the cytotoxic effects on TNFR- bearing cells. 

A^ object of the^nvent on to identify new members of the TNFR super family. It is anticipated that new family 
members may be transmembrane proteins or so.ubleformsthereof comprising extracellular doma.ns and lacking trans- 
Temorne aL cosmic domains. We have identified a new member <^^^7^S^ 
secreted protein that is closely related to TNFR-2. By analogy to soluble TNFR-1, the TNFR-2 related protein may 
negSvely regale the activity of its ligand, and thus may be useful in the treatment of certa.n human diseases. 

Summary of the Invention 

A novel member of the tumor necrosis factor receptor (TNFR) superfamily has been identified from a fetal rat 
intestinal cDNA library A full-length cDNA clone was obtained and sequenced. Expression of the rat cDNA in a trans- 
g^nTmous ^rev afed a marked increase in bones density, particular* in long bones, pelvic ^^^ b ~J£ 
polypeptide encoded by the cDNA is termed Osteprotegerin (OPG) and plays a role ,n promot.ng bone accumu 1atK»v 
P T^e invention p Jdes for nucleic acids encoding a polypeptide having at least one of the b.o.ogical activities of 
OPG Nucleic acids which hybridize to nucleic acids encoding mouse, rat or human OPG as shown ,n Figures 2B-2C 
SeQ ID NO 1^9^ (SEQ ID NO:1 22), and 9C-9D (SEQ ID NO:124) are also provided. Preferably, OPG ,s mam- 
ma£n OPG and more preferably is human OPG. Recombinant vectors and host cells express.ng OPG are ^ en- 
compassed as are methods of producing recombinant OPG. Antibodies or fragments thereof wh,ch specially b.nd 
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the polypeptide are also disclosed. 

Methods of treating bone diseases are also provided by the invention. The polypeptides are useful for preventing 
bone resorption and may be used to treat any condition resulting in bone loss such as osteoporosis, hypercalcemia, 
Paget's disease of bone, and bone loss due to rheumatoid arthritis or osteomyelitis, and the like. Bone diseases may 
5 also be treated with anti-sense or gene therapy using nucleic acids of the invention. Pharmaceutical compositions 
comprising OPG nucleic acids and polypeptides are also encompassed. 

Description of the Figures 

10 Figure 1. A. FASTA analysis of novel EST LORF. Shown is the deduced FRI-1 amino acid sequence aligned to 

the human TNFR-2 sequence. B. Profile analysis of the novel EST LORF shown is the deduced FRI-1 amino acid 
sequence aligned to the TNFR-profile. C. Structural view of TNFR superfamity indicating region which is homologous 
to the novel FRI-1. 

Figure 2. Structure and sequence of full length rat OPG gene, a novel member of the TNFR superfamily. A. Map 

is of pMOB-B1.1 insert. Box indicates position of LORF within the cDNA sequence (bold line). Black box indicates signal 
peptide, and gray ellipses indicate position of cysteine-rich repeat sequences. B, C. Nucleic acid and protein sequence 
• ■of the Rat OPG cDNA. The predicted signal peptide is underlined, and potential sites of N-linked glycosylation are 
.' indicated in bold, underlined letters. D, E. Pileup sequence comparison (Wisconsin GCG Package, Version 8.1) of 
OPG with other members of the TNFR superfamily. 

20 T Fas (SEQ ID NO 128); tnfrl (SEQ ID NO: 129); sfu-t2 (SEQ ID: 130); tnfr2 (SEQ ID NO: 131); id40 (SEQ ID NO: 
132); osteo (SEQ ID NO:133); ngfr (SEQ ID NO:134); ox40 (SEQ ID NO:135); 41bb (SEQ NO ID NO:136). 

Figure 3. PepPlot analysis (Wisconsin GCG Package, Version 8.1) of the predicted rat OPG protein sequence. A. 
Schematic representation of rat OPG showing hydrophobic (up) and hydrophiiic (down) amino acids. Also shown are 
basic (up) and acidic (down) amino acids. B. Display of amino acid residues that are beta-sheet forming (up) and beta- 

25 sheet breaking down) as defined by Chou and Fasman (Adv. Enz. 47, 45-1 47 (1 948)). C: Display of propensity meas- * 
ures for alpha-helix and beta-sheet (Chou and Fasman, ibid ). Curves above 1 .00 show propensity for alpha-helix or 
beta-sheet structure. Structure may terminate in regions of protein where curves drop below 1.00. D. Display of residues 
that are alpha-forming (up) or alpha-breaking (down). E. Display of portions of the protein sequence that resemble 
sequences typically found at the amino end of alpha and beta structures (Chou and Fasman, ibid). R Display of portions 

30 of the protein sequence that resemble sequences typically found at the carboxyl end of alpha and beta structures 
(Chou and Fasman, jbid). G. Display of portions of the proteins sequence typically found in turns (Chou and Fasman, 
ibid ) H. Display of the helical hydrophobic moment (Eisenberg et al. Proc. Natl. Acad. Sch USA ,81 , 140-144 (1984)) 
at each position in the sequence. I. Display of average hydrophathy based upon Kyte and Doolittle (J. Mol. Biol. 157, 
105-132 (1982)) and Goldman et al. (reviewed in Ann. Rev. Biophys. Biophys. Chem. 15, 321-353 (1986)). 

35 Figure 4. mRNA expression patterns for the OPG cDNA in human tissues. Northern blots were probed with a 32P- 

labeled rat cDNA insert (A, left two panels), or with the human cDNA insert (B, right panel). 

. Figure 5. Creation of transgenic mice expressing the OPG cDNA in hepatocytes. Northern blot expression of HE- 
OPG transgene in mouse liver. 

Figure 6. Increase in bone density in OPG transgenic mice. Panel A-F. Control Mice. G-J, OPG expressing mice. 

40 At necropsy, all animals were radiographed and photographs prepared. In A-F, the radiographs of the control animals 
and the one transgenic non-expressor (#28) are shown. Note that the bones have a clearly defined cortex and a lucent 
central marrow cavity. In contrast, the OPG (G-J) animals have a poorly defined cortex and increased density in the 
marrow zone. 

Figure 7. Increase in trabecular bone in OPG transgenic mice. A-D. Representative photomicrographs of bones 
45 from control animals. In A and B, low (4X, 10X) power images of the femurs are shown (Masson Trichrome stain). 
Stains for tartrate resistant acid phosphatase (TRAP) demonstrate osteoclasts (see arrows) both resorbing cartilage 
(C) and trabecular bone (D). Note the flattened appearance of osteoclasts on trabecular bone. E-H. Representative 
photomicrographs of bones from OPG-expressing animals. In E and F, low (4X, 10X) power images of the femurs are 
shown (Masson Trichrome stain). The clear region is the growth plate cartilage, blue stained area is bone, and the red 
so area is marrow. Note that in contrast to the controls, the trabecular bone has not been resorbed resulting in the absence 
of the usual marrow cavity. Also, the resulting trabeculae have a variegated appearance with blue and clear areas. 
The clear areas are remnants of growth plate cartilage that have never been remodelled. Based on TRAP stains, these 
animals do have osteoclasts (see arrows) at the growth plate (G), which may be reduced in number. However, the 
surfaces of the trabeculae away from the growth plate are virtually devoid of osteoclasts (H), a finding that stands in 
55 direct contrast with the control animals (see D). 

Figure 8. HE-OPG expressors do not have a defect in monocyte-macrophage development. One cause for oste- 
opetrosis in mice is defective M-CSF production due to a point mutation in the M-CSF gene. This results in a marked 
deficit of circulating and tissue based macrophages. The peripheral blood of OPG expressors contained monocytes 
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as assessed by H1E analysis. To affirm the presence of tissue macrophages, immnohistochemistry was performed 
using F480 antibodies, which recognize a cell surface antigen on murine macrophages. A and C show low power (4X) 
photomicrographs of the spleens from normal and CR1 overexpressors. Note that both animals have numerous F480 
positive cells. Monocyte-macrophages were also present in the marrow of normal (B) and HE-OPG overexpressors 

(D) Figure 9 Structure and sequence of mouse and human OPG cDNA clones. A, B. Mouse cDNA and protein se- 
quence C D Human cDNA and protein sequence. The predicted signal peptides are underlined, and potential sites 
of N-linked glycosylate* are indicated in bold. E, F. Sequence alignment and comparison of rat, mouse and human 
OPG amino acid sequences. 

io Figure 10 Comparison of conserved sequences in extracellular domain of TNFR-1 and human OP(a. 

PrettyPlot (Wisconsin GCG Package, Version 8.1 ) of the TNFR1 and OPG alignment described in example 6. Top 
line, human TNFR1 sequences encoding domains 1-4. Bottom line, human OPG sequences encoding domains 1-4. 
Conserved residues are highlighted by rectangular boxes. 

Figure 1 1 Three-dimensional representation of human OPG. Side-view of the Molescript display of the predicted 

is 3-dimensional structure of human OPG residues 25 through 163, (wide line), co^rystallized with human TNFp (thin 
line) As a reference fororientation, the bold arrows along the OPG polypeptide backbone are pointing in the N-terminal 
to C-terminal direction The location of individual cysteine residue side chains are inserted along the polypeptide back- 
bone to help demonstrate the separate cysteine-rich domains. The TNFp molecule is aligned as described by Banner 
Gt al (1 993) 

20 Figure 12 Structure ot OPG cysteine-rich domains. Alignment of the human (top line SEQ ID NO:136) and mouse 

(bottom line) OPG amino acid sequences highlighting the predicted domain structure of OPG . The polypeptide is divided 
into two halves; the N-terminus (A), and C-terminus (B). The N-terminal half is predicted to contain four cysteine rich 
domains (labeled 1-4). The predicted intrachain disulfide bonds are indicated by bold lines, labeled "SSV, "SS2 n , or 
B SS3° Tyrosine 28 and histidine 75 (underlined) are predicted to form an ionic interaction. Those amino acids predicted 
25 to interact with an OPG ligand are indicated by bold dots above the appropriate residue. The cysteine residues located • 
in the C-terminal half of OPG are indicated by rectangular boxes. 

Figure 1 3 Expression and secretion of full length and truncated mouse OPG-Fc fusion proteins. A. Map indicating 
points of fusion to the human lgG1 Fc domain are indicated by arrowheads. B. Silver stain of and SDS-polyacry lam.de 
gel of conditioned media obtained from Fl.Fc (Full length OPG fused to Fc at Leucine 401 ) and CT.Fc (Carboxy-terminal 
30 truncated OPG fused to Fc at threonine 180) fusion protein expression vectors. Lane 1, parent pCEP4 expression 
vector cell line- Lane 2, Fl.Fc vector cell line; Lane 3, CT.Fc vector cell line. C. Western blot of conditioned media 
obtained from Fl Fc and CT.Fc fusion protein expression vectors probed with anti-human lgG1 Fc domain (Pierce). 
Lane 1 parent pCEP4 expression vector cell line; Lane 2, Fl.Fc vector cell line; Lane 3, CT.Fc vector cell line. 

Figure 14 Expression of human OPG in E. coli. A. Construction of a bacterial expression vector. The LORF of the 
35 human OPG gene was amplified by PCR, then joined to a oligonucleotide linker fragment (top strand is SEQ ID NO: 
137- bottom strand is SEQ ID NO:127), and ligated into pAMG21 vector DNA. The resulting vector is capable of ex- 
pressing OPG residues 32-401 linked to a N-terminal methionine residue. B SDS-PAGE analysis of uninduced and 
induced bacterial harboring the pAMG21 -human OPG - 32-401 plasmid. Lane 1, MW standards; lane 2, uninduced 
bacteria- lane 3 30°C induction; lane 4, 37°C induction; lane 5, whole cell lysate from 37° C induction; lane 6, soluble 
40 fraction of whole cell lysate; lane 7, insoluble fraction of whole cell lysate; lane 8, purified inclusion bodies obtained 
from whole cell lysate. , t , . _ . 

Figure 15 Analysis of recombinant murine OPG produced in CHO cells by SDS-PAGE and western blotting. An 
equal amount of CHO conditioned media was applied to each lane shown, and was prepared by treatment with either 
reducing sample buffer (left lane), or non-reducing sample buffer (right lane). After electrophoresis, the resolved pro- 
45 teins were transferred to a nylon membrane, then probed with anti-OPG antibodies. The relative positions ot the 55 
kd monomeric and 100 kd dimeric forms of OPG are indicated by arrowheads. 

Figure 1 6 Pulse-chase analysis of recombinant murine OPG produced in CHO cells. CHO cells were pulse-labeled 
with ^s-methionine/cysteine, then chased for the indicated time. Metabolically labeled cultures were separated into 
both conditioned media and cells, and detergent extracts were prepared from each, clarified, then immunoprecipitated 
50 with anti-OPG antibodies. The immunoprecipitates were the resolved by SDS-PAGE, and exposed to film. Top left and 
right panels- samples analyzed under non-reducing conditions. Lower left and right panels; samples analyzed under 
reducing conditions. Top and bottom left panels; Cell extracts. Top and bottom right panels; Conditioned media extracts. 
The relative mobility of the 55 kd monomeric and 100 kd dimeric forms of OPG are indicated by arrowheads. 

Figure 17. Expression of OPG in the CTLL-2 cell line. Serum-free conditioned media from CTLL-2 cells and CHO- 
ss mu OPG [1 -401 ] transfected cells was prepared, concentrated, then analyzed by non-reducing SDS-PAGE and western 
blotting. Left lane; CTLL-2 conditioned media. Right lane; CHO-muOPG conditioned media. The relative mobility of 
the 55 kd monomeric and 1 00 kd dimeric forms of OPG are indicated by arrowheads. 

Figure 18. Detection of OPG expression in serum samples and liver extracts obtained from control and OPG 
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transgenic mice. Transgenic mice were constructed as described in Example 4. OPG expression was visualized after 
SDS-PAGE followed by Western blotting using anti-OPG antibodies. 

Figure 19. Effects of huOPG [22-401 ]-Fc fusion protein on osteoclast formation jn vitro . The osteoclast forming 
assay was performed as described in Example 11 A in the absence (control) or presence of the indicated amounts of 
s huOPG [22-401 ]-Fc fusion. Osteoclast formation was visualized by histochemical staining for tartrate acid phosphatase 
(TRAP). ). A. OPG added to 100 ng/ml. D. OPG added to 0.1 ng/ml. E. OPG added to 0.01 ng/ml. F. OPG added to 
0.001 ng/ml. G. Control. No OPG added. 

Figure 20. Decrease in osteoclast culture TRAP activity with increasing amounts of OPG. Indicated concentrations 
of huOPG [22-401 ]-Fc fusion protein were added to osteoclast forming assay and TRAP activity quantitated as de- 
10 scribed in Example 11 A. 

Figure 21. Effect ol OPG on a terminal stage of osteoclast differentiation. huOPG [22-40 1]-Fc fusion was added 
to the osteoclast forming assay during the intermediate stage of osteoclast maturation (days 5-6; OPG-CTL) or during 
the terminal stage of osteoclast maturation (days 7-15; CTL-OPG). TRAP activity was quantitated and compared with 
the activity observed in the absence of OPG (CTL-CTL) in the presence of OPG throughout (OPG-OPG). 
15 Figure 22. Effects of IL-1 p, IL-1 a and OPG on blood ionized calcium in mice. Levels of blood ionized calcium were 

monitored after injection of IL-1p alone, IL-1aalone, IL-1 p plus muOPG [22-401]-Fc, IL-1aplus MuOPG [22-401]-Fc, 
and muOPG [22-40l]-Fc alone. Control mice received injections of phosphate buffered saline (PBS) only. IL-1 B ex- 
periment shown in A; IL-1 a experiment shown in B. 

Figure 23. Effects of OPG on calvarial osteoclasts in control and I L1 -treated mice. Histological methods for ana- 
20 lyzing mice calvarial bone samples are described in Example 1 1 B. Arrows indicate osteoclasts present in day 2-treated 
mice. Calvarial samples of mice receiving four PBS injections daily (A), one injection of IL-1 and three injections of 
PBS daily (B), one injection of PBS and three injections of OPG daily (C), one injection of IL-1 and three injections of 
OPG daily. 

Figure 24. Radiographic analysis of bone accumulation in marrow cavity of normal mice. Mice were injected sub- 
25 cutaneously with saline (A) or muOPG [22-401 ]-Fc fusion (5mg/kg/d) for 14 days (B) and bone density determined as * 
described in Example 11 C. 

Figure 25. Histomorphometric analysis of bone accumulation in marrow cavity of normal mice. Injection experi- 
ments and bone histology performed as described in Example 11C. 

Figure 26. Histology analysis of bone accumulation in marrow cavity of normal mice. Injection experiments and 
30 bone histology performed as described in Example 11 C. A. Saline injection B. Injection of muOPG [22-401 ]-Fc fusion. 

Figure 27. Activity of OPG administered to ovariectomized rats. In this two week experiment the trend to reduced 
bone density appears to be blocked by OPG or other anti-resorptive therapies. DEXA measurements were taken at 
time of ovariectomy and at week 1 and week 2 of treatment. The results are expressed as % change from the initial 
bone density (Mean +/- SEM). 

35 Figure 28. Bone density in the femoral metaphysis, measured by histomorphometric methods, tends to be lower 

in ovariectomized rats (OVX) than sham operated animals (SHAM) 17 days following ovariectomy. This effect was 
blocked by OPG-Fc, with OPG-Fc treated ovariectomized rats (OVX+OPG) having significantly higher bone density 
than vehicle treated ovariectomized rats (OVX). (Mean +/- SEM). 

40 Detailed Description of the Invention 

A novel member of the tumor necrosis factor receptor (TNFR) superfamily was identified as an expressed sequence 
tag (EST) isolated from a fetal rat intestinal cDNA library . The structures of the full-length rat cDNA clones and the 
corresponding mouse and human cDNA clones were determined as described in Examples 1 and 6. The rat, mouse 

45 and human genes are shown in Figures 2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID NO:122), and 9C-9D (SEQ ID NO: 
1 24), respectively. All three sequences showed strong similarity to the extracellular domains of TNFR family members. 
None of the full-length cDN A clones isolated encoded transmembrane and cytoplasmic domains that would be expected 
for membrane-bound receptors, suggesting that these cDN As encode soluble, secreted proteins ratherthan cell surface 
receptors. A portion of the human gene spanning nucleotides 1200-1353 shown in Figure 9D was deposited in the 

50 Genebank database on November 22, 1995 under accession no. 17188769. 

The tissue distribution of the rat and human mRNA was determined as described in Example 2. In rat, mRNA 
expression was detected in kidney, liver, placenta and heart with the highest expression in the kidney. Expression in 
skeletal muscle and pancreas was also detected. In humans, expression was detected in the same tissues along with 
lymph node, thymus, spleen and appendix. 

55 The rat cDNA was expressed in transgenic mice (Example 3) using the liver-specific ApoE promoter expression 

system. Analysis of expressors showed a marked increase in bone density, particularly in long bones (femurs), verte- 
brae and flat bones (pelvis). Histological analysis of stained sections of bone showed severe osteopetrosis (see Ex- 
ample 4) indicating a marked imbalance between bone formation and resorption which has led to a marked accumu- 
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lationof bone and cartilage. A decrease inthenumber of trabecular osteoclasts in the bones of OPG expressor animals 
indicate that a significant portion of the activity of the TNFR-related protein may be to prevent bone resorption, a process 
mediated by osteoclasts. In view of the activity in transgenic expressors, the TNFR-related proteins described herein 
are termed OPGs. 

5 Using the rat cDNA sequence, mouse and human cDNA clones were isolated (Example 5). Expression of mouse 

OPG in 293 cells and human OPG in E. coli is described in Examples 7 and 8. Mouse OPG was produced as an Fc 
fusion which was purified by Protein A affinity chromatography. Also described in Example 7 is the expression of full- 
length and truncated human and mouse OPG polypeptides in CHO and 293 cells either as fusion polypeptides to the 
Fc region of human lgG1 or as unfused polypeptides. The expression of full-length and truncated human and mouse 

io OPGs in E. coli either as Fc fusion polypeptides or as unfused polypeptides is described in Example 8. Purification of 
recombinant^ produced mammalian and bacterial OPG is described in Example 10. 

The biological activity of OPG was determined using an in vitro osteoclast maturation assay, an in vivo model of 
interleukin-1 (IL-1) induced hypercalcemia, and injection studies of bone density in normal mice (see Example 11). 
The following OPG recombinant proteins produced in CHO or 293 cells demonstrated activity in the in E. coli osteoclast 

is maturation assay: muOPG [22-185]-Fc, muOPG [22-1 94]-Fc, muOPG [22-401 ]Fc, muOPG [22-401], huOPG [22-201]- 
Fc, huOPG [22-401]-Fc. muOPG [22-180]-Fc produced in CHO cells and huOPG met[32-401] produced in E. coli did 
not demonstrate activity in the in vitro assay. 

OPG from several sources was produced as a dimer and to some extent as a higher multimer. Rat OPG [22-401] 
produced in transgenic mice, muOPG [22-401] and huOPG [22-401] produced as a recombinant polypeptide in CHO 

20 cells, and OPG expressed as a naturally occurring product from a cytotoxic T cell line were predominantly dimers and 
trimers when analyzed on nonreducing SDS gels (see Example 9). Truncated OPG polypeptides having deletions in 
the region of amino acids 186-401 (e.g., OPG [1-185] and OPG [1-194]) were predominantly monomeric suggesting 
that the region 186-401 may be involved in self -association of OPG polypeptides. However, huOPG met[32-401] pro- 
duced in E. coli was largely monomeric. 

25 OPG may be important in regulating bone resorption. The protein appears to act as a soluble receptor of the TNF ' 

family and may prevent a receptor-ligand interaction involved in the osteolytic pathway. One aspect of the regulation 
appears to be a reduction in the number of osteoclasts. 

Nucleic Acids 

30 

The invention provides for an isolated nucleic acid encoding a polypeptide having at least one of the biological 
activities of OPG. As described herein, the biological activities of OPG include, but are not limited to, any activity 
involving bone metabolism and in particular; include increasing bone density. The nucleic acids of the invention are 
selected from the following: 

35 

a) the nucleic acid sequences as shown in Figures 2B-2C (SEQ ID NO:120) 5 9A-9B (SEQ ID NO:122), and 9C- 
9D (SEQ ID NO: 124) or complementary strands thereof; 

b) the nucleic acids which hybridize under stringent conditions with the polypeptide-encoding region in Figures 
2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID NO:122), and 9C-9D (SEQ ID NO:124); and 

40 c) nucleic acids which hybridize under stringent conditions with nucleotides 148 through 337 inclusive as shown 

in Figure 1 A. 

d) the nucleic acid sequences which are degenerate to the sequences in (a) and (b). 

The invention provides for nucleic acids which encode rat, mouse and human OPG as well as nucleic acid se- 
45 quences hybridizing thereto which encode a polypeptide having at least one of the biological activities of OPG. Also 
provided for are nucleic acids which hybridize to a rat OPG EST encompassing nucleotides 1 48-337 as shown in Figure 
1 A. The conditions for hybridization are generally of high stringency such as 5xSSC, 50% formamide and 42°C de- 
scribed in Example 1 of the specification. Equivalent stringency to these conditions may be readily obtained by adjusting 
salt and organic solvent concentrations and temperature. The nucleic acids in (b) encompass sequences encoding 
50 OPG-related polypeptides which do not undergo detectable hybridization with other known members of the TNF re- 
ceptor super-family. In a preferred embodiment, the nucleic acids are as shown in Figures 2B-2C (SEQ ID NO: 120), 
9A-9B (SEQ I D NO: 1 22), and 9C-9D (SEQ I D NO: 1 24). 

The length of hybridizing nucleic acids of the invention may be variable since hybridization may occur in part or 
all of the polypeptide-encoding regions as shown in Figures 2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID NO:122), and 
55 9C-9D (SEQ ID NO: 124), and may also occur in adjacent noncoding regions. Therefore, hybridizing nucleic acids may 
be truncations or extensions of the sequences shown in Figures (SEQ ID NO:120) 2B-2C , 9A-9B (SEQ ID NO: 122), 
and 9C-9D (SEQ ID NO:124). Truncated or extended nucleic acids are encompassed by the invention provided they 
retain one or more of the biological properties of OPG. The hybridizing nucleic acids may also include adjacent non- 
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coding regions which are 5' and/or 3' to the OPG coding region. The noncoding regions include regulatory regions 
involved in OPG expression, such as promoters, enhance, translational initiation sites, transcription termination sites 
and the like. 

Hybridization conditions for nucleic acids are described in Sambrook et al. Molecular Cloning: A Laboratory Manual , 

5 2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1989) 

DNA encoding rat OPG was provided in plasmid pMO-B1 .1 deposited with the American Type Culture Collection, 
Rockville, MD on December 27, 1 995 under ATCC accession no. 69970. DNA encoding mouse OPG was provided in 
plasmid pRcCMV-murine OPG deposited with the American Type Culture Collection, Rockville, MD on December 27, 
1995 under accession no. 69971. DNA encoding human OPG was provided in plasmid pRcCMV - human OPG de- 

10 posited with the American Type Culture Collection, Rockville, MD on December 27, 1995 under accession no. 69969. 
The nucleic acids of the invention will hybridize under stringent conditions to the DNA inserts of ATCC accession nos. 
69969, 69970, and 69971 and have at least one of the biological activities of OPG. 

Also provided by the invention are derivatives of the nucleic acid sequences as shown in Figures 2B, 9A and 9B. 
As used herein, derivatives include nucleic acid sequences having addition, substitution, insertion or deletion of one 

is or more residues such that the resulting sequences encode polypeptides having one or more amino acid residues 
which have been added, deleted, inserted or substituted and the resulting polypeptide has the activity of OPG. The 
nucleic acid derivatives may be naturally occurring, such as by splice variation or polymorphism, or may be constructed 
using site-directed mutagenesis techniques available to the skilled worker. One example of a naturally occurring variant 
of OPG is a nucleic acid encoding a lys to asn change at residue 3 within the leader sequence (see Example 5). It is 

20 anticipated that nucleic acid derivatives will encode amino acid changes in regions of the molecule which are least 
likely to disrupt biological activity. Other derivatives include a nucleic acid encoding a membrane-bound form of OPG 
having an extracellular domain as shown in Figures 2B-2C (SEQ ID NO:120), 9A-9B (SEQ ID NO:122), and 9C-9D 
(SEQ ID NO:124) along with transmembrane and cytoplasmic domains. 

In one embodiment, derivatives of OPG include nucleic acids encoding truncated forms of OPG having one or 

25 more amino acids deleted from the carboxy terminus. Nucleic acids encoding OPG may have from 1 to 216 amino * 
acids deleted from the carboxy terminus. Optionally, an antibody Fc region may extend from the new carboxy terminus 
to yield a biologically active OPG-Fc fusion polypeptide, (see Example 11). In preferred embodiments, nucleic acids 
encode OPG having the amino acid sequence from residues 22-185, 22-189, 22-1 94 or 22-201 (using numbering in 
Figure 9E-F) and optionally, encoding an Fc region of human IgG. 

30 Also included are nucleic acids encoding truncated forms of OPG having one or more amino acids deleted from 

the amino terminus. Truncated forms include those lacking part or all the 21 amino acids comprising the leader se- 
quence. Additionally the invention provides for nucleic acids encoding OPG having from 1 to 10 amino acids deleted 
from the mature amino terminus (at residue 22) and .optionally, having from 1 to 216 amino acids deleted from the 
carboxy terminus (at residue 401 ). Optionally, the nucleic acids may encode a methionine residue at the amino terminus: 

35 Examples of such OPG truncated polypeptides are described in Example 8. 

: Examples of the nucleic acids of the invention include cDNA, genomic DNA, synthetic DNA and RNA. cDNA is 
obtained from libraries prepared from mRNA isolated from various tissues expressing OPG. In humans, tissue sources 
for OPG include kidney, liver, placenta and heart. Genomic DNA encoding OPG is obtained from genomic libraries 
which are commercially available from a variety of species. Synthetic DNA is obtained by chemical synthesis of over- 

40 lapping oligonucleotide fragments followed by assembly of the fragments to reconstitute part or all of the coding region 
and flanking sequences (see U.S. Patent No. 4,695,623 describing the chemical synthesis of interferon genes). RNA 
is obtained most easily by procaryotic expression vectors which direct high-level synthesis of mRNA, such as vectors 
using T7 promoters and RNA polymerase. 

Nucleic acid sequences of the invention are used for the detection of OPG sequences in biological samples in 

45 order to determine which cells and tissues are expressing OPG mRNA. The sequences may also be used to screen 
cDNA and genomic libraries for sequences related to OPG. Such screening is well within the capabilities of one skilled 
in the art using appropriate hybridization conditions to detect homologus sequences. The nucleic acids are also useful 
for modulating the expression of OPG levels by anti-sense therapy or gene therapy. The nucleic acids are also used 
for the development of transgenic animals which may be used for the production of the polypeptide and for the study 

50 of biological activity (see Example 3). 

Vectors and Host Cells 

Expression vectors containing nucleic acid sequences encoding OPG, host cells transformed with said vectors 
55 and methods for the production of OPG are also provided by the invention. An overview of expression of recombinant 
proteins is found in Methods of Enzvmoloqy v. 185, Goeddel, D.V. ed. Academic Press (1990). 

Host cells for the production of OPG include procaryotic host cells, such as E. coli, yeast, plant, insect and mam- 
malian host cells. E. constrains such as HB101 or JM101 are suitable for expression. Preferred mammalian host cells 
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include COS CHOd- 293 CV-1 , 3T3, baby hamster kidney (BHK) cells and others. Mammalian host cells are preferred 
when post-translational modifications, such as glycosy lation and polypeptide processing, are important for OPG activity 
Mammalian expression allows for the production of secreted polypeptides which may be recovered from the growth 

medium. , , 

5 Vectors for the expression of OPG contain at a minimum sequences required for vector propogation and for ex- 

pression of the cloned insert. These sequences include a replication origin, selection marker, promoter, ribosome bind- 
ing site enhancer sequences, RNA splice sites and transcription termination site. Vectors suitable for expression in 
the aforementioned host cells are readily available and the nucleic acids of the invention are inserted into the vectors 
using standard recombinant DNA techniques. Vectors for tissue-specific expression of OPG are also included. Such 

io vectors include promoters which function specifically in liver, kidney or other organs for production in mice, and viral 
vectors for the expression of OPG in targeted human cells. 

Using an appropriate host-vector system, OPG is produced recombinants by culturing a host cell transformed with 
an expression vector containing nucleic acid sequences encoding OPG under conditions such that OPG is produced, 
and isolating the product of expression. OPG is produced in the supernatant of transfected mammalian cells or in 

is inclusion bodies of transformed bacterial host cells. OPG so produced may be purified by procedures known to one 
skilled in fhe art as described below. The expression of OPG in mammalian and bacterial host systems is described 
in Examples 7 and 8 Expression vectors for mammalian hosts are exemplified by plasmids such as pDSRa described 
in PCT Application No 90/1 4363. Expression vectors for bacterial host cells are exemplified by plasmids pAMG21 and 
P AMG22-His described in Example 8. Plasmid pAMG21 was deposited with the American Type Culture Collection, 

20 Rockville MD on July 24, 1 996 under accession no. 9811 3. Plasmid pAMG22-His was deposited with the American 
Type Culture Collection, Rockville, MD on July 24, 1996 under accession no. 98112. It is anticipated that the specific 
plasmids and host cells described are for illustrative purposes and that other available plasmids and host cells could 
also be used to express the polypeptides. 

The invention also provides for expression of OPG from endogenous nucleic acids by in vivo or ex yjvo recombi- 

2S nation events to allow modulation of OPG from the host chromosome. Expression of OPG by the introduction of ex- • 
ogenous regulatory sequences (e.g. promoters or enhancers) capable of directing the production of OPG from endog- 
enous OPG coding regions is also encompassed. Stimulation of endogenous regulatory sequences capable of directing 
OPG production (e.g. by exposure to transcriptional enhancing factors) is also provided by the invention. 

30 Polypeptides 

The invention provides for OPG, a novel member of the TNF receptor superfamily, having an activity associated 
with bone metabolism and in particular having the activity of inhibiting bone resorption thereby increasing bone density 
OPG refers to a poVpeptide having an amino acid sequence of mouse, rat or human OPG or a derivative thereof 

35 havinq at least one of the biological activities of OPG. The amino acid sequences of rat, mouse and human OPG are 
shown in Figures 2B-2C (SEQ ID NO:121), 9A-9B (SEQ ID NO:123). and 9C-9D (SEQ ID NO:125) respectively. A 
derivative of OPG refers to a polypeptide having an addition, deletion, insertion or substitution of one or more amino 
acids such that the resulting polypeptide has at least one of the biological activities of OPG. The biolog.ca activities 
of OPG include, but are not limited to, activities involving bone metabolism. Preferably, the polypeptides will have the 

40 amino terminal leader sequence of 21 amino acids removed. 

OPG polypeptides encompassed by the invention include rat [1-401], rat [22-180], rat [22-401], rat [22-401 ]-Fc 
fusion rat [1-180]-Fc fusion, mouse[1-401], mouse [1-180], mouse [22-401], human [1-401], mouse [22-180], human 
f22-401l human [22-180], human [1-180], human [22-180]-Fc fusion and human met-32-401 . Amino acid numbering 
is as shown in SEQ ID NO:121 (rat), SEQ ID NO:123 (mouse) and SEQ ID NO:125 (human). Also encompassed are 

45 polypeptide derivatives having deletions or carboxy-terminal truncations of part or all of amino acids residues 1 80-401 
of OPG- one or more amino acid changes in residues 180-401 ; deletion of part or all of a cysteine-rich domain of OPG, 
in particular deletion of the distal (carboxy-terminal) cysteine-rich domain; and one or more amino acid changes in a 
cysteine-rich domain, in particular in the distal (carboxy-terminal) cysteine-rich domain. In one embodiment, OPG has 
from 1 to about 21 6 amino acids deleted from the carboxy terminus. In another embodiment, OPG has from 1 to about 

so 10 amino acids deleted from the mature amino terminus (wherein the mature amino terminus is at residue 22) and, 
optionally has from 1 to about 216 amino acids deleted from the carboxy terminus. 

Additional OPG polypeptides encompassed by the invention include the following: human [22-180]-Fc fusion, hu- 
man [22-201 ]-Fc fusion, human [22-401]-Fc fusion, mouse [22-185]-Fc fusion, mouse [22-194]-Fc fusion. These 
polypeptides are produced in mammalian host cells, such as CHO or 293 cells, Additional OPG polypeptides encom- 

ss passed by the invention which are expressed in procaryotic host cells include the following: human met[22-401 ], Fc- 
human met[22-401] fusion (Fc region is fused at the amino terminus of the full-length OPG coding sequence as de- 
scribed in Example 8), human met[22-401]-Fc fusion (Fc region fused to the full-lengh OPG sequence), Fc-mouse met 
[22-401] fusion mouse met[22-401]-Fc fusion, human met[27-401], human met[22-185], human met[22-189], human 
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met[22-194], human met[22-194] (P25A), human met {22-1 94] (P26A), human met[27-185], human met[27-189], hu- 
man met[27-194], human met-arg-gly-ser-(his) 6 [22-401], human met-lys [22-401], human met-(lys) 3 -[22-401] s human 
met[22-401]-Fc (P25A), human met[22-401] (P25A), human met[22-401] (P26A). human met[22-401] (P26D), mouse 
met[22-401], mouse met[27-401], mouse met[32-401], mouse met[27-180], mouse met[22-l89], mouse met[22-194], 

5 mouse met[27-189], mouse met[27-194], mouse met-lys[22-401], mouse HEK[22-401](A45T), mouse met-lys-(his)7 
[22-401], mouse met-lys[22-401]-(his)7 and mouse met[27-401] (P33E, G36S, A45P). It is understood that the above 
OPG polypeptides produced in procaryotic host cells have an amino-terminal methionine residue, it such a residue is 
not indicated. In specific examples, OPG-Fc fusion were produced using a 227 amino acid region of human IgGl^yl 
was used having the sequence as shown in Ellison*et al. (Nuc. Acids Res. JO, 4071-4079 (1982)). However, variants 

10 of the Fc region of human IgG may also be used. 

Analysis of the biological activity of carboxy-terminal OPG truncations fused to the human lgG1 Fc region indicates 
a portion of OPG of about 1 64 amino acids which is required for activity. This region encompasses amino acids 22-1 85, 
preferably those in Figure 9C-9D (SEQ ID NO: 125), and comprises four cysteine-rich domains characteristic of the 
cysteine-rich domains of TNFR extraceullular domains. 

is Using the homology between OPG and the extracellular ligand binding domains of TNF receptor family members, 

a three-dimensional model of OPG was generated based upon the known crystal structure of the extracellular domain 
of TNFR-I (see Example 6). This model was used to identify those residues within OPG which may be important for 
biological activity Cysteine residues that are involved in maintaining the structure of the four cysteine-rich domains 
were identified. The following disulfide bonds were identified in the model: Domain 1 : cys41 to cys54, cys44 to cys62, 

20 tyr23 and his 66 may act to stabilize the structure of this domain; Domain 2: cys65 to cys80, cys83 to cys98, cys87 to 
cyslOS; Domain 3: cys!07 tocyslB, cys124 tocys142; Domain 4: cys145tocys!60, cys166to cys185. Residues were 
also identified which were in close proximity to TNFp as shown in Figures 11 and 12A-12B. In this model, it is assumed 
that OPG binds to a corresponding ligand; TNF0 was used as a model ligand to simulate the interaction of OPG with 
its ligand. Based upon this modeling, the following residues in OPG may be important for ligand binding: glu34, Iys43, 

25 P ro66 to gln91 (in particular, pro66, his68 s tyr69, tyr70, thr71, asp72, ser73, his76, ser77, asp78, glu79, Ieu81, tyr82, * 
pro85, yal86, Iys88, glu90 and g1n91), glu153 and ser155. 

Alterations in these amino acid residues, either singly or in combination, may alter the biological activity of OPG. 
For example, changes in specific cysteine residues may alter the structure of individual cysteine-rich domains, whereas 
changes in residues important for ligand binding may affect physical interactions of OPG with ligand. Structural models 

30 can aid in identifying analogs which have more desirable properties, such as enhanced biological activity, greater 
stability, or greater ease of formulation. 

The invention also provides for an OPG multimer comprising OPG monomers. OPG appears to be active as a 
muitimer (e.g. dimer, trimer of a higher number of monomers). Preferably, OPG multimers are dimers or trimers. OPG 
multimers may comprise monomers having the amino acid sequence of OPG sufficient to promote multimer formation 

35 or may comprise monomers having heterologous sequences such as an antibody Fc region. Analysis of carboxy- 
terminal deletions of OPG suggest that at least a portion of the region 186-401 is involved in association of OPG 
polypeptides. Substitution of part or all of the region of OPG amino acids 1 86-401 with an amino acid sequence capable 
of self-association is also encompassed by the invention. Alternatively, OPG polypeptides or derivatives thereof may 
be modified to form dimers or multimers by site directed mutagenesis to create unpaired cysteine residues for interchain 

40 disulfide bond romation, by photochemical crosslinking, such as exposure to ultraviolet light, or by chemical crosslinking 
with Afunctional linker molecules such as bifunctional polyethylene glycol and the like. 

Modifications of OPG polypeptides are encompassed by the invention and include post-translational modifications 
(e.g., N-linked or O-linked carbohydrate chains, processing of N-terminal or C-terminal ends), attachment of chemical 
moieties to the amino acid backbone, chemical modifications of N-linked or O-linked carbohydrate chains, and addition 

45 of an N-terminal methionine residue as a result of procaryotic host cell expression. The polypeptides may also be 
modified with a detectable label, such as an enzymatic, fluorescent, isotopic or affinity label to allow for detection and 
isolation of the protein. 

Further modifications of OPG include chimeric proteins wherein OPG is fused to a heterologous amino acid se- 
quence. The heterologous sequence may be any sequence which allows the resulting fusion protein to retain the 
50 activity of OPG. The heterologous sequences include for example, immunoglobulin fusions, such as Fc fusions, which 
may aid in purification of the protein. A heterologous sequence which promotes association of OPG monomers to form 
dimers, trimers and other higher multimeric forms is preferred. 

The polypeptides of the invention are isolated and purified from other polypeptides present in tissues, cell lines 
and transformed host cells expressing OPG, or purified from components in cell cultures containing the secreted pro- 
55 tein. In one embodiment, the polypeptide is free from association with other human proteins, such as the expression 
product of a bacterial host cell. 

Also provided by the invention are chemically modified derivatives of OPG which may provide. additional advan- 
tages such as increasing stability and circulating time of the polypeptide, or decreasing immunogenicity (see U.S. 
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Patent No 4 179 337). The chemical moieties for derivitization may be selected from water soluble polymers such as 
polyethylene glycol, ethylene glycol/propylene glycol copolymers, carboxymethylcellulose, dextran. polyvinyl alcohol 
and the like. The polypeptides may be modified at random positions within the molecule, or at predetermined posrt.ons 
within the molecule and may include one, two, three or more attached chemical moieties. 

s The polymer may be of any molecular weight, and may be branched or unbranched. For polyethylene glycol, the 

preferred molecular weight is between about IkDa and about 1 0OkDa (the term "about" indicating that in preparations 
of polyethylene glycol, some molecules will weigh more, some less, than the stated molecular weight) for ease in 
handling and manufacturing. Other sizes may be used, depending on the desired therapeutic profile (e.g., the durat.on 
of sustained release desired, the effects, if any on biological activity, the ease in handling, the degree or lack of anti- 

io qenicity and other known effects of the polyethylene glycol to a therapeutic protein or analog). 

The polyethylene glycol molecules (or other chemical moieties) should be attached to the protein with consideration 
of effects on functional or antigenic domains of the protein. There are a number of attachment methods availab e to 
those skilled in the art, e.g. EP 0 401 384 herein incorporated by reference (coupling PEG to G-CSF), see also Malik 
et al Exp Hematol 2Q- 1028-1035 (1992) (reporting pegyiation of GM-CSF using tresyl chloride). For example, pol- 

15 yethylene glycol may be covalently bound through amino acid residues via a reactive group, such as, a free amino or 
carboxyl group Reactive groups are those to which an activated polyethylene glycol molecule may be bouno\ The 
amino acid residues having a free amino group may include lysine residues and the N-terminal amino acid residues; 
those having a free carboxyl group may include aspartic acid residues glutamic acid residues and the C-term.nal amino 
acid residue Sulfhydrl groups may also be used as a reactive group for attaching the polyethylene gVcol molecules). 

20 Preferred for therapeutic purposes is attachment at an amino group, such as attachment at the N-term.nus or lysine 

9r ° U One may specifically desire N-terminally chemically modified protein. Using polyethylene glycol as an illustration 
of the present compositions, one may select from a variety of polyethylene glycol molecules (by molecular we.ght, 
branching etc ), the proportion of polyethylene glycol molecules to protein (or peptide) molecules in the reaction mix, 
25 the type of pegyiation reaction to be performed, and the method of obtaining the selected N-terminally pegylated protein^ - 
The method of obtaining the N-terminally pegylated preparation (i.e., separating this moiety from other monopegylaed 
moieties if necessary) may be by purification of the N-terminally pegylated material from a populate of pegylated 
protein molecules. Selective N-terminal chemically modification may be accomplished by reductive alkylat.on which 
exploits differential reactivity of different types of primary amino groups (lysine versus the N-term.nal) available for 
30 derealization in a particular protein. Under the appropriate reaction conditions, substantially selective derivat.zat.on of 
the protein at the N-terminus with a carbonyl group containing polymer is achieved. 

Synthetic OPG dimers may be prepared by various chemical crosslinking procedures. OPG monomers may be 
chemically linked in any fashion that retains or enhances the biological activity of OPG. A variety of chemical crosslmk- 
ers may be used depending upon which properties of the protein dimer are desired. For example, crosslinkers may be 
35 short and relatively rigid or longer and more flexible, may be biologically reversible, and may provide reduced immu- 
nogenicity or longer pharmacokinetic half-life. , u . , c , 0 , 0 n 

In one example, OPG molecules are linked through the amino terminus by a two step synthesis (see Example 12). 
In the first step OPG is chemically modified at the amino terminus to introduce a protected thiol, which after purification 
is deprotected and used as a point of attachment for site-specific conjugation through a variety of crosslinkers with a 
second OPG molecule. Amino-terminal crosslinks include, but are not limited to, a disulfide bond, thioether linkages 
using short-chain, bis-functional aliphatic crosslinkers, and thioether linkages to variable length, Afunctional polyeth- 
ylene glycol crosslinkers (PEG "dumbbells"). Also encompassed by PEG dumbbell synthesis of OPG dimers is a by- 
product of such synthesis, termed a "monobell". An OPG monobell consists of a monomer coupled to a linear bifunc- 
tional PEG with a free polymer terminus: Alternatively, OPG may be crosslinked directly through a variety of amine 
specific homobifunctional crosslinking techniques which include reagents such as: diethylenetnaminepentaacetic di- 
anhydride (DTPA), p-benzoquinone (pBQ) or bis(sulfosuccinimidyl) suberate (BS3) as well as others known in the art 
It is also possible to thiolate OPG directly with reagents such as iminothiolane in the presence of a variety of Afunctional, 
thiol specific crosslinkers, such as PEG bismaleimide, and achieve dimerization and/or dumbbells in aone step process. 
A method for the purification of OPG from natural sources and from transfected host cells is also included. The 
so purification process may employ one or more standard protein purification steps in an appropriate order to obtain 
purified protein. The chromatography steps can include ion exchange, gel filtration, hydrophobic interaction, reverse 
phase, chromatofocusing, affinity chromatography employing an anti-OPG antibody or biotin-streptavidin affinity com- 
plex and the like. 



40 
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55 Antibodies 



Also encompassed by the invention are antibodies specifically binding to OPG. Antigens for the generation of 
antibodies may be full-length polypeptides or peptides spanning a portion of the OPG sequence. Immunological pro- 
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cedures for the generation of polyclonal or monoclonal antibodies reactive with OPG are known to one skilled in the 
art (see, for example, Harlow and Lane, Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor N.Y. (1988)). Antibodies so produced are characterized for binding specificity and epitope recognition 
using standard enzyme-linked immunosorbent assays. Antibodies also include chimeric antibodies having variable 
5 and constant domain regions derived from different species. In one embodiment, the chimeric antibodies are humanized 
antibodies having murine variable domains and human constant domains. Also encompassed are complementary 
determining regions grafted to a human framework (so-called CDR-grafted antibodies). Chimeric and CDR-grafted 
antibodies are made by recombinant methods known to one skilled in the art. Also encompassed are human antibodies 
made in mice. 

10 Anti-OPG antibodies of the invention may be used as an affinity reagent to purify OPG from biological samples 

(see Example 1 0). In one method, the antibody is immobilized on CnBr-activated Sepharose and a column of antibody- 
Sepharose conjugate is used to remove OPG from liquid samples. Antibodies are also used as diagnostic reagents to 
detect and quantitate OPG in biological samples by methods described below. 

15 Pharmaceutical compositions 

The invention also provides for pharmaceutical compositions comprising a therapeutically effective amount of the 
polypeptide of the invention together with a pharmaceutical^ acceptable diluent, carrier, solubilizer, emulsifier, pre- 
servative and/or adjuvant. The term "therapeutically effective amount" means an amount which provides a therapeutic 
20 effect for a specified condition and route of administration. The composition may be in a liquid or lyophilized form and 
comprises a diluent (Tris, acetate or phosphate buffers) having various pH values and ionic strengths, solubilizer such 
as Tween or Polysorbate, carriers such as human serum albumin or gelatin, preservatives such as thimerosal or benzyl 
alcohol, and antioxidants such as ascrobic acid or sodium metabisulfite. Also encompassed are compositions com- 
prising OPG modified with water soluble polymers to increase solubility or stability. Compositions may also comprise 
25 incorporation of OPG into liposomes, microemulsions, micelles or vesicles for controlled delivery over an extended * 
period of time. Specifically, OPG compositions may comprise incorporation into polymer matricies such as hydrogels, 
silicones, polyethylenes, ethylene-vinyl acetate copolymers, or biodegradable polymers. Examples of hydrogels in- 
clude polyhydroxyalkylmethacrylates (p-HEMA), polyacrylamide, poly methacry lam ide, polyvinylpyrrolidone, polyvinyl 
alcohol and various polyelectrolyte complexes. Examples of biodegradable polymers include polylactic acid (PLA), 
30 polyglycolicacid (PGA), copolymers of PLA and PGA, polyamides and copolymers of polyamides and polyesters. Other 
controlled re I ease formulations include microcapsules, microspheres, macromolecular complexes and polymeric beads 
which may be administered by injection. 

Selection of a particular composition will depend upon a number of factors, including the condition being treated, 
the route of administration and the pharmacokinetic parameters desired. A more extensive survey of component suit- 
es able for pharmaceutical compositions is found in Remington's Pharmaceutical Sciences , 18th ed. A.R. Gennaro, ed. 
Mack, Easton, PA (1980). 

Compositions of the invention may be administered by injection, either subcutaneous, intravenous or intramuscular, 
or by oral, nasal, pulmonary or rectal administration. The route of administration eventually chosen will depend upon 
a number of factors and may be ascertained by one skilled in the art. 
40 The invention also provides for pharmaceutical compositions comprising a therapeutically effective amount of the 

nucleic acids of the invention together with a pharmaceutical^ acceptable adjuvant. Nucleic acid compositions will be 
suitable for the delivery of part or all of the OPG coding region to cells and tissues as part of an anti-sense or gene 
therapy regimen. 

45 Methods of Treatment 

Bone tissue provides support for the body and consists of mineral (largely calcium and phosphorous), a matrix of 
collagenous and noncollagenous proteins, and cells. Three types of cells found in bone, osteocytes, osteoblasts and 
osteoclasts, are involved in the dynamic process by which bone is continually formed and resorbed. Osteoblasts pro- 

50 mote formation of bone tissue whereas osteoclasts are associated with resorption. Resorption, or the dissolution of 
bone matrix and mineral, is a fast and efficient process compared to bone formation and can release large amounts 
of mineral from bone. Osteoclasts are involved in the regulation of the normal remodeling of skeletal tissue and in 
resorption induced by hormones. For instance, resorption is stimulated by the secretion of parathyroid hormone in 
response to decreasing concentrations of calcium ion in extracellular fluids. In contrast, inhibition of resorption is the 

55 principal function of calcitonin. In addition, metabolites of vitamin D alter the responsiveness of bone to parathyroid 
hormone and calcitonin. 

After skeletal maturity, the amount of bone in the skeleton reflects the balance (or imbalance) of bone formation 
and bone resorption. Peak bone mass occurs after skeletal maturity prior to the fourth decade. Between the fourth and 
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fifth decades the equilibrium shifts and bone resorption dominates. The inevitable decrease in bone mass with ad- 
vancing years starts earlier in females than males and is distinctly accelerated after menopause in some females 
(principally those of Caucasian and Asian descent). 

Osteopenia is a condition relating generally to any decrease in bone mass to below normal levels. Such a condition 

5 may arise from a decrease in the rate of bone synthesis or an increase in the rate of bone destruction or both. The 
most common form of osteopenia is primary osteoporosis, also referred to as postmenopausal and senile osteoporosis. 
This form of osteoporosis is a consequence of the universal loss of bone with age and is usually a result of increase 
in bone resorption with a normal rate of bone formation. About 25 to 30 percent of all white females in the United States 
develop symptomatic osteoporosis. A direct relationship exists between osteoporosis and the incidence of hip, femoral, 

10 neck and inter-trochanteric fracture in women 45 years and older. Elderly males develop symptomatic osteoporosis 
between the ages of 50 and 70, but the disease primarily affects females. 

The cause of postmenopausal and senile osteoporosis is unknown. Several factors have been identified which 
may contribute to the condition. They include alteration in hormone levels accompanying aging and inadequate calcium 
consumption attributed to decreased intestinal absorption of calcium and other minerals. Treatments have usually 

is included hormone therapy or dietary supplements in an attempt to retard the process. To date, however, an effective 
treatment for bone loss does not exist. 

The invention provides for a method of treating a bone disorder using a therapeutically effective amount of OPG. 
The bone disorder may be any disorder characterized by a net bone loss (osteopenia or osteolysis). In general, treat- 
ment with OPG is anticipated when it is necessary to suppress the rate of bone resorption. Thus treatment may be 

20 done to reduce the rate of bone resorption where the resorption rate is above normal or to reduce bone resorption to 
below normal levels in order to compensate for below normal levels of bone formation. 
Conditions which are treatable with OPG include the following: 

Osteoporosis, such as primary osteoporosis, endocrine osteoporosis (hyperthyroidism, hyperparathyroidism, 
2S Cushing's syndrome, and acromegaly), hereditary and congenita! forms of osteoporosis (osteogenesis imperfecta, 

homocystinuria, Menkes' syndrome, and Riley-Day syndrome) and osteoporosis due to immobilization of extrem- 
ities. 

Paget's disease of bone (osteitis deformans) in adults and juveniles 
Osteomyelitis, or an infectious lesion in bone, leading to bone loss. 
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Hypercalcemia resulting from solid tumors (breast, lung and kidney) and hematologic malignacies (multiple mye- 
loma, lymphoma and leukemia), idiopathic hypercalcemia, and hypercalcemia associated with hyperthryoidism and 
renal function disorders. 

Osteopenia following surgery, induced by steroid administration, and associated with disorders of the small ana 
35 large intestine and with chronic hepatic and renal- diseases. 

Osteonecrosis, or bone cell death, associated with traumatic injury or nontraumatic necrosis associated with Gau- 
cher's disease, sickle cell anemia, systemic lupus erythematosus and other conditions. 
Bone loss due to rheumatoid arthritis. 
Periodontal bone loss. 
40 Osteolytic metastasis 

It is understood that OPG may be used alone or in conjunction with other factors for the treatment of bone disorders. 
In one embodiment, osteop rote ge rein is used in conjunction with a therapeutically effective amount of a factor which 
stimulates bone formation. Such factors include but are not limited to the bone morphogenic factors designated BMP- 
1 through BMP-12, transforming growth factor-^ (TGF-p) and TGF-p family members, interleukin-1 inhibitors, TNFa 
45 inhibitors, parathyroid hormone and analogs thereof, parathyroid related protein and analogs thereof, E series pros- 
taglandins, bisphosphonates (such as alendronate and others), and bone-enhancing minerals such as fluoride and 
calcium. 

The following examples are offered to more fully illustrate the invention, but are not construed as limiting the scope 
thereof. 
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EXAMPLE 1 

Identification and isolation of the rat OPG cDNA 



55 Materials and methods for cDNA cloning and analysis are described in Maniatis et al, ibid. Polymerase chain 

reactions (PCR) were performed using a Perkin-Elmer 9600 thermocycler using PCR reaction mixture (Boeh ringer- 
Mannheim) and primer concentrations specified by the manufacturer. In general, 25-50 uJ reactions were denatured 
at 94°C followed by 20-40 cycles of 94°C for 5 seconds, 50-60°C for 5 seconds, and 72°C for 3-5 minutes. Reactions 
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were the treated for 72 °C for 3-5 minutes. Reactions were then analyzed by gel electrophoresis as described in 
Maniatis et al., ibid . 

A cDNA library was constructed using mRNA isolated from embryonic d20 intestine for EST analysis (Adams et 
al. Science 252, 1651-1656 (1991)). Rat embryos were dissected, and the entire developing small and large intestine 

s removed and washed in PBS. Total cell RN A was purified by acid guanidinium thiocyanate-phenol-chlorof orm extraction 
(Chomczynski and SacchiAnal. Biochem. 162, 156-159, (1987)). The poly (A+) mRNA fraction was obtained from the 
total RNA preparation by adsorption to, and elution from, Dynabeads Oligo (oT)25 (Dynal Corp) using the manufac- 
turers recommended procedures. A random primed cDNA library was prepared using the Superscript Plasmid System 
(Gibco BRL, Gaithersburg, Md). The random cDNA primer containing an internal Not I restriction site was used to 

io initiate first strand synthesis and had the following sequence: 



,S^AAA^AAnnAAAAAA GCGGCCGC TACANNNNNNNNT-3 / (SEQ ID NO: 1) 



• For the first strand synthesis three separate reactions were assembled that contained 2.5 u.g of poly(A) RNA and 
120 ng, 360 ng or 1,080 ng of random primer. After second strand synthesis, the reaction products were separately 
extracted with a mixture of phenol:choroform:isoamyl alcohol (25:24: 1 ratio), and then ethanol precipitated. The double 
20 strand (ds) cDNA products of the three reactions were combined and ligated to the following ds oligonucleotide adapter: 

5' -TCGACCCACGCGTCCG-3' (SEQ ID NO: 2) 
25 3' -GGGTGCGCAGGCp-5 ' (SEQ ID NO: 3) 

After ligation the cDNA was digested to completion with Not I, extracted with phenol:chloroform:isoamyl (25:24:1 ) 
alcohol and ethanol precipitated. The resuspended cDNA was then size fractionated by gel filtration using premade 

30 columns provided with the Superscript Plasmid System (Gibco BRL, Gaithersburg, Md) as recommended by the man- 
ufacturer. The two fractions containing the largest cDNA products were pooled, ethanol precipitated and then direc- 
tionally ligated into Not I and Sal I digested pMOB vector DNA (Strathmann et al, 1991). The ligated cDNA was intro- 
duced into competent ElectroMAX DH10B E. coli (Gibco BRL, Gaithersburg, MD) by electroporation. For automated 
sequence analysis approximately 1 0,000 transformants were plated on 20cm x 20cm agar plates containing ampicillin 

35 supplemented LB nutrient media. The colonies that arose were picked and arrayed onto 96 well microtiter plates con- 
taining 200 ml of L-broth, 7.5% glycerol, and 50 ng/ml ampicillin. The cultures were grown overnight at 37°C, a duplicate 
set of microtiter plates were made using a sterile 96 pin replicating tool, then both sets were stored at -80°C for further 
analysis. For full-length cDNA cloning approximately one million transformants were plated on 96 bacterial ampicillin 
plates containing about 1 0,000 clones each. The plasmid DNA from each pool was separately isolated using the Qiagen 

40 Plasmid Maxi Kit (Qiagen Corp. .Germany) and arrayed into 96 microtiter plates for PCR analyses. 

To sequence random fetal rat intestine cDNA clones, glycerol stocks were thawed, and small aliquots diluted 1: 
25 in distilled. Approximately 3.0 ul of diluted bacterial cultures were added to PCR reaction mixture (Boehringer- 
Mannheim) containing the following oligonucleotides: 

45 

5'-TGTAAAACGACGGCCAGT-3' (SEQ ID NO: 4) 
5' -CAGGAAACAGCTATGACC-3' (SEQ ID NO: 5) 

so 

The reactions were incubated in a thermocycler (Perkin-Elmer 9600) with the following cycle conditions: 94 C for 2 
minutes; 30 cycles of 94°C for 5 seconds, 50°C for 5 seconds, and 72°C for 3 minutes.; 72°C for 4 minutes. After 
incubation in the thermocycler, the reactions were diluted with 2.0 mL of water. The amplified DNA fragments were 
further purified using Centricon columns (Princeton Separations) using the manufacturer's recommended procedures. 
55 The PCR reaction products were sequenced on an Applied Biosystems 373A automated DNA sequencer using T3 
primer (oligonucleotide 353-23; 5'-CAATTAACCCTCACTAAAGG-3') (SEQ ID NO:6). Taq dye-terminator reactions (Ap- 
plied Biosystems) following the manufacturer's recommended procedures. 

The resulting 5" nucleotide sequence obtained from randomly picked cDNA clones translated and then compared 
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to the existing database of known protein sequences using a modified version of the FASTA program (Pearson et al. 
Math. Enzymol. 183, (1990)). Translated sequences were also analysed for the presence of a spec ^ qjatene^* 
protein motif founTTn all known members of the tumor necrosis factor receptor (TNFR) supertam J ^ 
76. 959-962 (1994)), using the sequence profile method of Gribskov et al. (Proc. Natl. Acad. Sc.. USA83, 4355-4359 
(1987)), as modified by Luethy et al. (Protein Science 3, 139-146 (1994)). „^ eiK , Q nmu 

Using the FASTA and Profile search data, an EST, FRI-1 (Fetal Rat lntest.ne-1 ), was identified as a possible new 
member of the TNFR superfamiV- FRI-1 contained an approximately 600 bp insert with a LORF of about 150 ammo 
adds The closest match in the database was the human type II TNFR (TNFR-2). The region compared showed an 
-43% homology between TNFR-2 and FRI-1 over this 1 50 aa LORF. Profile analysis using the first and second cysteme- 
rich repeats of the TNFR superfamily yielded a Z score of -8, indicating that the FRI-1 gene possibly encodes a new 
family member. To deduce the structure of the FRI-1 product, the fetal rat intestine cDNA library was screened for full 
length clones. The following oligonucleotides were derived from the original FRI-1 sequence: 



is 



5 ■ -GCATTATGACCCAGAAACCGGAC-3 ■ (SEQ ID NO: 7) 
5 » -AGGTAGCGCCCTTCCTCACATTC- 3 (SEQ ID NO: 8) 

These primers were used in PCR reactions to screen 96 pools of plasmid DN A, each pool containing plasmid DMA 
from 10 000 independent cDNA clones. Approximately 1 ug of plasmid pool DNA was amplified in a PCR reaction 
mixture (Boehringer-Mannheim) using a Perkin-Elmer 96 well thermal cycler with the following cycle conditions. 2 mm 
at 94-C 1 cycle- 15 sec at 94°C, then 45 sec at 65°C, 30 cycles; 7 min at 65°C, 1 cycle. PCR reaction products were 
analysed by gel electrophoresis. 1 3 out of 96 plasmid DNA pools gave rise to amplified DNA products with the expected' 

re,at DNr f tm?ne m posmve poo. was used to transform competent E.ectroMAX DH10B E. coH (Gibco BRL, Gaithers- 
burq MD) as described above. Approximately 40,000 transformants were plated onto sterile nitrocellulose filters (BA- 
SS Schleicher and Schuell), and then screened by colony hybridization using a 32 P -d C TP labelled version of the PCR 
product obtained above. Filters were prehybridized in 5X SSC, 50% deionized formam.de, 5X Denhardt s solution 
0 5% SDS and 100 ug/ml denatured salmon sperm DNA for 2-4 hours at 42»C. Filters were then hybridized in 5X 
SSC, 50% deionized formamide, 2X Denhardt's solution. 0.1% SDS. 100 ug/ml denatured salmon spe mi DNA and ^-5 
nq/ml of labelled probe for -18 hours at 42°C. The filters were then washed in 2X SSC for 1 0 m.n a t RT, 1 X SSC tor 
10 min at 55°C. and finally in 0.5X SSC for 10-15 min at 55°C. Hybridizing clones were detected following autoradi- 
ography, and then replated onto nitrocellulose filters for secondary screening. Upon secondary screening, a plasmid 
clone (pB1.1) was isolated, then amplified in L-broth media containing 100 ug/ml ampicill.n and the plasmid DNA 
obtained. Both strands of the 2.4 kbpB1.1 insert were sequenced. 

The dB1 1 insert sequence was used for a FASTA search of the public database to detect any existing sequence 
matches and/or similarities. No matches to any known genes or ESTs were found, although there was an approximate 
45% similarity to the human and mouse TNFR-2 genes. A methionine start codon is found at bp 124 of the nucleot de 
sequence, followed by a LORF encoding 401 aa residues that terminates at bp 1327. The 401 aa residue product is 
predicted to have a hydrophobic signal peptide of approximately 31 residues at its N-term.nus, and 4 potential sites of 
N-linked glycosylation. No hydrophobic transmembrane spanning sequence was identified us.ngthe PepPlot program 
(Wisconsin GCG package, version 8.1 ). The deduced 401 aa sequence was then used to search the protein database. 
45 Again there were no existing matches, although there appeared to be a strong similarity to many members of the 
TNFRsuperfamily, most notably the human and mouse TNFR-2. A sequence alignment of this novel protein with known 
members lot the TNFR-superfamilywaspreparedusingthe Pileup program, andthen modified by PrettyPlot (Wisconsin 
GCG package version 8.1). This alignment shows a clear homology between the full length FRI-1 gene product and 
all other TNFR family members. The homologus region maps to the extracellular domain of TNFR family members, 
so and corresponds to the three or four cysteine-rich repeats found in the ligand binding domain of these proteins. This 
suggested that the FRI-1 gene encoded a novel TNFR family member. Since no transmembrane spanning region was 
detected we predicted that this may be a secreted receptor, similar to TNFR-1 derived soluble receptors (Kohno et a . 
Proc. Natl. Acad. Sci. USA 87, 8331 -8335 (1 990)). Due to the apparent biological activity of the FRI-1 gene (vide infra), 
the product was named Osteoprotegerin (OPG). 



20 



25 



30 



35 



40 



SS 



14 



BNSDOCID: <EP Q784093A1_I_> 



EP 0 784 093 A1 

EXAMPLE 2 

OPG mRNA Expression Patterns in Tissues . 

s Multiple human tissue northern blots (Clonetech) were probed with a 32 P-dCTP labelled FRI-1 PCR product to 

detect the size of the human transcript and to determine patterns of expression. Northern blots were prehybridized in 
5X SSPE, 50% formamide, 5X Denhardt's solution, 0.5% SDS, and 100 |ig/m! denatured salmon sperm DNA for 2-4 
hr at 42°C. The blots were then hybridized in 5X SSPE, 50% formamide, 2X Denhardt's solution, 0.1% SDS, 100 \igf 
ml denatured salmon sperm DNA, and 5 ng/ml labelled probe for 18-24 hr at 42°C. The blots were then washed in 2X 

10 SSC for 10 min at RT, 1X SSC for 10 min at 50°C, then in 0.5X SSC for 10-15 min. 

Using a probe derived from the rat gene, a predominant mRNA species with a relative molecular mass of about 
2.4 kb is detected in several tissues, including kidney, liver, placenta, and heart. Highest levels are detected in the 
kidney. A large mRNA species of Mr 4.5 and 7.5 kb was detected in skeletal muscle and pancreas. In human fetal 
tissue, kidney was found to express relatively high levels of the 2.4 kb mRNA. Using a human probe (vide infra), only 

is the 2.4 kb transcript is detected in these same tissues. In addition, relatively high levels of the 2.4 kb transcript was 
detected in the lymph node, thymus, spleen and appendix. The size of the transcript detected by both the rat and 
human Osteosprotegerin gene is almost identical to the length of the rat pBl .1 FRl-1 insert, suggesting it was a full 
length cDNA clone. 

20 EXAMPLE 3 

Systemic delivery of OPG in transgenic mice 

The rat OPG clone pB1.1 was used as template to PCR amplify the coding region for subcloning into an ApoE- 
25 liver specific expression vector (Simonet et al. J. Clin. Invest. 94, 1310-1319 (1994), and PCT Application No. * 
US94/11675 arid co-owned U.S. Serial No. 08/221 ,767. The following 5* and 3' oligonucleotide primers were used for 
PCR amplification, respectively: 

30 5 f -GACTAGTCCCACAATGAACAAGTGGCTGTG-3 ' (SEQ ID NO: 9) 

5 1 -ATAAGAATGCGGCCGCTAAACTATGAAACAGCCCAGTGACCATTC- 3 ■ 

(SEQ ID NO: 10) 

35 

The PCR reaction mixture (Boehringer-Mannheim) was treated as follows: 94°C tor 1 minute, 1 cycle; 94°C for 20 
sec, 62°C for 30 sec, and 74 C for 1 minute,, 25 cycles. Following amplification, the samples were purified over Qiagen 
PCR columns and digested overnight with Spel and Notl restriction enzymes. The digested products were extracted 
and precipitated and subcloned into the ApoE promoter expression vector. Prior to microinjecting the resulting clone, 

40 HE-OPG, it was sequenced to ensure it was mutation -free. 

The HE-OPG plasmid was purified through two rounds of CsCI density gradient centrif ugation. The purified plasmid 
DNA was digested with Xhol and Ase 1, and the 3.6 kb transgene insert was purified by gel electrophoresis. The purified 
fragment was diluted to a stock injection solution of 1 jag/ml in 5 mM Tris, pH 7.4, 0.2 mM EDTA. Single-cell embryos • 
from BDF1 x BDF1-bred mice were injected essentially as described (Brinster et al., Proc. Natl. Acad. ScL USA 82, 

45 4338 (1 985)), except that injection needles were beveled and siliconized before use. Embryos were cultured overnight 
in a C0 2 incubator and 1 5 to 20 2-cell embryos were transferred to the oviducts of pseudopregnant CD1 female mice. 

Following term pregnancy, 49 offspring were obtained from implantation of microinjected embryos. The offspring 
were screened by PCR amplification of the integrated transgene in genomic DNA samples. The target region for am- 
plification was a 369 bp region of the human Apo E intron which was included in the expression vector. The oligos 

50 used for PCR amplification were: 

5'- GCC TCT AGA AAG AGC TGG GAC-3' (SEQ ID NO: 11) 
55 5 ,_ CGC CGT GTT CCA TTT ATG AGC-3 f (SEQ ID NO: 12) 

The conditions for PCR were: 94°C for 2 minute, 1 cycle; 94°C for 1 min, 63°C for 20 sec, and 72°C for 30 sec, 
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30 cycles. Of the 49 original offspring, 9 were identified as PCR positive transgenic founders. 

At 8-10 weeks of age, five transgenic founders (2, 11, 16, 17, and 28) and five controls (1, 12, 15, 18, and 30) 
were sacrificed for necropsy and pathological analysis. Liver was isolated from the remaining 4 founders by partial 
hepatectomy. For partial hepatectomy, the mice were anesthetized and a lobe of liver was surgically removed Total 

5 cellular RNA was isolated from livers of all transgenic founders, and 5 negative control littermates as described (Mc- 
Donald et al Meth. Enzymol. 152, 21 9 (1 987)). Northern blot analysis was performed on these samples to assess the 
level of transgene expression. Approximately 1 0ug of total RNA from each animal liver was resolved by electrophoresis 
denaturing gels (Ogden et al. Meth. Enzymol 152, 61 (1987)), then transferred to HYBOND-N nylon membrane (Am- 
ersham) and probed with 3 2p dCTP-labelled pB1 .1 insert DNA. Hybridization was performed overnight at 42 C in 50 A 

10 Formam'ide 5 x SSPE, 0.5% SDS, 5 x Denhardt's solution, 100 ug/ml denatured salmon sperm DNA and 2-4x10 
cpm of labeled probe/ml of hybridization buffer. Following hybridization, blots were washed twice in 2 x SSC, 0.1% 
SDS at room temperature for 5 min each, and then twice in 0. 1 x SSC, 0. 1 % SDS at 55'C for 5-1 0 min each. Expression 
of the transgene in founder and control littermates was determined following autoradiography. 

The northern blot data indicate that 7 of the transgenic founders express detectable levels of the transgene mRNA 

is (animal#'s2 11 16 17 22,33,and45). The negative control mice and one of the founders (#28) expressed no transgene- 
related mRNA Since OPG is predicted to be a secreted protein, overexpression of transgene mRNA should be a proxy 
for the level of systemically delivered gene product. Of the PCR and northern blot positive mice, animal 2, 17 and 22 
expressed the highest levels of transgene mRNA, and may show more extensive biological effects on host cells and 
tissues. 
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EXAMPLE 4 

Biological activity of OPG 

25 Five of the transgenic mice (animals 2,11,16,17 and 28) and 5 control littermates (animals 1,12,15,18, and 30) * 

were sacrificed for necropsy and pathological analysis using the following procedures: Prior to euthanasia, all animals 
had their identification numbers verified, then were weighed, anesthetized and blood drawn. The blood was saved as 
both serum and whole blood for a complete serum chemistry and hematology panel. Radiography was performed just 
after terminal anesthesia by lethal C02 inhalation, and prior to the gross dissection. Following this, tissues were re- 

30 moved and fixed in 10% buffered Zn-Fonmalin for histological examination. The tissues collected included the liver, 
spleen pancreas, stomach, duodenum, ileum, colon, kidney, reproductive organs, skin and mammary glands, bone 
brain heart, lung, thymus, trachea, eosphagus, thyroid, jejunem, cecum, rectum, adrenals, urinary bladder, and skeletal 
muscle Prior to fixation the whole organ weights were determined for the liver, stomach, kidney, adrenals, spleen, and 
thymus After fixation the tissues were processed into paraffin blocks, and 3 urn sections were obtained. Bone tissue 

35 was decalcified using a formic acid solution, and all sections were stained with hematoxylin and eosm. In addition, 
staining with Gomori's reticulin and Massorfs trichrome were performed on certain tissues. Enzyme histochemistry 
was performed to determine the expression of tartrate resistant acid phosphatase (TRAP), an enyzme highly expressed 
by osteoclasts, multinucleated bone-resorbing cells of monocyte-macrophage lineage. Immunohistochemistry for BrdU 
and F480 monocyte-macrophage surface antigen was also performed to detect replicating cells and cells of the mono- 

40 cyte-macrophage lineage, respectively. To detect F480 surface antigen expression, formalin fixed, paraffin embedded 
4um sections were deparaffinized and hydrated to deionized water. The sections were quenched with 3% hydrogen 
peroxide blocked with Protein Block (Lipshaw, Pittsburgh, PA), and incubated in rat monoclonal anti-mouse F480 
(Harlan Indianapolis, IN). This antibody was detected by biotinylated rabbit anti-rat immunoglobulins, peroxidase con- 
jugated'strepavidin (BioGenex San Ramon, CA) with DAB as chromagen (BioTek, Santa Barbara, CA). Sections were 

45 counterstained with hematoxylin. 

Upon gross dissection and observation of visceral tissues, no abnormalities were found in the transgene expressors 
or control littermates. Analysis of organ weight indicate that spleen size increased by approximately 38% in the trans- 
genic mice relative to controls. There was a slight enlargement of platelet size and increased circulating unstained 
cells in the transgene expressors. There was a marginal decrease in platelet levels in the transgene expressors. In 

so addition the serum uric acid, urea nitrogen, and alkaline phosphatase levels all trended lower in the transgene ex- 
pressors The expressors were found to have increased radiodensity of the skeleton, including long bones (femurs), 
vertebrae, and flat bones (pelvis). The relative size of femurs in the expressors were not different from the the control 

mice. ... 
Histological analysis of stained sections of bone from the OPG expressors show severe osteopetrosis with the 
55 presence of cartilage remnants from the primary spongiosa seen within bone trabecules in the diaphysis of the femur. 
A clearly defined cortex was not identifiable in the sections of femur. In normal animals, the central diaphysis is filled 
with bone marrow Sections of vertebra also show osteoporotic changes implying that the OPG-induced skeletal chang- 
es were systemic The residual bone marrow showed predominantly myeloid elements. Megakaryocytes were present. 
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Reticulin stains showed no evidence for reticulin deposition. I mm un ©histochemistry for F480, a cell surface antigen 
expressed by cells of monocyte-macrophage derivation in the mouse, showed the presence of F480 positive cells in 
th* marrow spaces. Focally, flattened F480 positive cells could be seen directly adjacent to trabecular bone surfaces. 
The mesenchymal cells lining the bony trabeculae were flattened and appeared inactive. Based on H&E and TRAP 

5 stains, osteoclasts were rarely found on the trabecular bone surfaces in the OPG expressors. In contrast, osteoclasts 
and/or chondroclasts were seen in the region of the growth plate resorbing cartilage, but their numbers may be reduced 
compared to controls. Also, osteoclasts were present on the cortical surface of the metaphysis where modelling activity 
is usually robust. The predominant difference between the expressors and controls was the profound decrease in 
trabecular osteoclasts, both in the vertebrae and femurs. The extent of bone accumulation was directly correlated with 

10 the level of OPG transgene mRNA detected by northern blotting of total liver RNA. 

The spleens from the OPG expressors had an increased amount of red pulp with the expansion due to increased 
hematopoiesis. All hematopoietic lineages are represented. F480 positive cells were present in both control and OPG 
expressors in the red pulp. Two of the expressors (2 and 17)had foci of extramedullary hematopoiesis within the liver 
and this is likely due to the osteopetrotic marrow 

is There were no observable abnormalities in the thymus, lymph nodes, gastrointestinal tract, pancreato-hepatobiliary 

tract, respiratory tract, reproductive system, genito-urinary system, skin, nervous system, heart and aorta, breast, skel- 
etal muscle and fat. 

EXAMPLE 5 

20 

Isolation of mouse and human OPG cDNA 

A cDNA clone corresponding to the 5" end of the mouse OPG mRNA was isolated from a mouse kidney cDNA 
library (Clontech) by PCR amplification. The oligonucleotides were derived from the rat OPG cDNA sequence and are 
25 shown below: 

5 • -ATCAAAGGCAGGGCATACTTCCTG-3 f (SEQ ID NO: 13) 
30 5 1 -GTTGCACTCCTGTTTCACGGTCTG-3 9 (SEQ ID NO: 14) 

5 f -CAAGACACCTTGAAGGGCCTGATG-3 1 (SEQ ID NO: 15) 
35 5 ■ - TAACTTTT ACAGAAGAGCATCAGC- 3 1 (SEQ ID NO: 16) 

5 ■ -AGCGCGGCCGCATGAACAAGTGGCTGTGCTGCG-3 ' (SEQ ID NO: 17) 

40 

5 ' -AGCTCTAGAGAAACAGCCCAGTGACCATTCC-3 ' (SEQ ID NO: 18) 

The partial and full-length cDNA products obtained in this process were sequenced. The full-length product was 
45 digested with Not I and Xba I, then directionally cloned into the plasmid vector pRcCMV (Invitrogen). The resulting 
plasmid was named pRcCMV-Mu-OPG. The nucleotide sequence of the cloned product was compared to the rat OPG 
cDNA sequence. Over the 1300 bp region spanning the OPG LORF, the rat and mouse DNA sequences are approx- 
imately 88% identical. The mouse cDNA sequence contained a 401 aa LORF, which was compared to the rat OPG 
protein sequence and found to be -94% identical without gaps. This indicates that the mouse cDNA sequence isolated 
50 encodes the m urine OPG protein, and that the sequence and structure has been highly conserved throughout evolution. 
The mouse OPG protein sequence contains an identical putative signal peptide at its N-terminus, and all 4 potential 
sites of N-linked glycosylation are conserved. 

A partial human OPG cDNA was cloned from a human kidney cDN A library using the following rat-specific oligo- 
nucleotides: 

55 
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5'-GTG AAG CTG TGC AAG AAC CTG ATG-3' (SEQ ID NO: 19) 
5'-ATC AAA GGC AGG GCA TAC TTC CTG-3' (SEQ ID NO: 20) 

This PCR product was sequenced and used to design primers for amplifying the 3' end of the human cDNA using 
a human OPG genomic clone in lambda as template: 

5' -TCCGTAAGAAACAGCCCAGTGACC- 3 ' (SEQ ID NO: 29) 
5' -CAGATCCTGAAGCTGCTCAGTTTG- 3 ' (SEQ ID NO: 21) 

The amplified PCR product was sequenced, and together with the 5" end sequence, was used to design 5' and 3 1 
human-specific primers useful for amplifying the entire human OPG cDNA coding sequences: 



20 5 ' -AGCGCGGCCGCGGGGACCACAATGAACAAGTTG- 3 ' (SEQ ID N O: 22) 

5' - AGCTCTAGAATTGTGAGGAAACAGCTCAATGGC- 3 ' (SEQ ID NO: 23) 

25 The full-length human PCR product was sequenced, then directionally cloned into the plasmid vector pRcCMV 

(Invitrogen) using Not I and Xba I. The resulting plasmid was named pRcCMV-human OPG, The nucleotide sequence 
of the cloned product was compared to the rat and mouse OPG cDN A sequences. Over the 1 300 bp region spanning 
the OPG LORF the rat and mouse DNA sequences are approximately 78-88% identical to the human OPG cDNA. 
The human OPG cDNA sequence also contained a 401 aa LORF, and it was compared to the rat and mouse prote.n 

30 sequences The predicted human OPG protein is approximatlely 85% identical, and -90% identical to the rat and 
mouse proteins, respectively. Sequence alignment of rat. mouse and human proteins show that they have been highly 
conserved during evolution. The human protein is predicted to have a N-terminal signal peptide, and 5 potential sites 
of N-linked glycosylation, 4 of which are conserved between the rat and mouse OPG proteins. 

The DNA and predicted amino acid sequence of mouse OPG is shown in Figure 9A and 9B (SEQ ID NO.122). 

35 The DNA and predicted amino acid sequence of human OPG is shown in Figure 9C an 9D (SEQ ID NQ124). A 
comparison of the rat, mouse and human OPG amino acid sequences is shown in Figure 9E and 9F. 

Isolation of additional human OPG cDNA clones revealed the presence of a G to C base change at position 103 
of the DNA sequence shown in Figure 9C. This nucleotide change results in substitution of an asparagme for a lysine 
at position 3 of the amino acid sequence shown in Figure 9C. The remainder of the sequence in clones having this 

40 change was identical to that in Figure 9C and 9D. 

EXAMPLE 6 

OPG three-dimensional structure modelling 

The amino-terminal portion of OPG has homology to the extracellular portion of all known members of the TNFR 
superfamily (Figure 1C). The most notable motif in this region of TNFR-related genes is an -40 amino acid, cyste.ne- 
rich repeat sequence which folds into distinct structures (Banner et al. Cell 73, 431-445 (1993)). This motif is usually 
displayed in four (range 3-6) tandem repeats (see Figure 1C), and is known to be involved in ligand binding (BeuUar 
so and van Huff el Science 264, 667-663 (1 994)). Each repeat usually contains six interspaced cysteine residues, which 
are involved in forming three intradomain disulfide bonds, termed SS1 , SS2, and SS3 (Banner et al., ibjd) In some 
receptors, such as TNFR2, CD30 and CD40, some of the repeat domains contain only two intrachain disulfide bonds 

(SS1 and SS3). . 

The human OPG protein sequence was aligned to aTNFRI extracellular domain profile using methods described 
55 by Luethy, et al., ibid, and the results were graphically displayed using the PrettyPlot program from the W.scons.n 
Package version 8 1 (Genetics Computer Group, Madison, Wl) (Figure 10). The alignment indicates a clear conser- 
vation of cysteine residues involved in formation of domains 1-4. This alignment was then used to construct a three- 
dimensional (3-D) model of the human OPG N-terminal domain using the known 3-D structure of the extracellular 



18 



BNSDOCIO: <EP 07B4O93A1_l_> 



EP 0 784 093 A1 



domain of p55 TNFR1 (Banner et al., jbid) as the template. To do this the atomic coordinates of the peptide backbone 
and side chains of identical residues were copied from the crystal structure coordinates of TNFR1. Following this, the 
remaining coordinates for the insertions and different side chains were generated using the LOOK program (Molecular 
Applications Group, Palo Alto, CA). The 3-D model was then refined by minimizing its conformational energy using 
5 LOOK. 

By analogy with other TNFR family members, it is assumed that OPG binds to a ligand. For the purpose of modelling 
the interaction of OPG with its ligand, the crystal structure of TNF-p was used to simulate a 3-D representation of an 
u OPG ligand". This data was graphically displayed (see Figure 11) using Molscript (Kraulis, J. Appl. Cryst. 24, 946-950, 
1991). A model for the OPG/ligand complex with 3 TNFp and 3 OPG molecules was constructed where the relative 

io positions of OPG are identical to TNFR1 in the crystal structure. This model was then used to find the residues of OPG 
that could interact with its ligand using the following approach: The solvent accessible area of all residues in the complex 
and one single OPG model were calculated. The residues that have different accessibility in the complex than in the 
monomer are likely to interact with the ligand. 

The human and mouse OPG amino acid sequences were realigned using this information to highlight sequences 

is comprising each of the cysteine rich domains 1-4 (Figure 12A and 12B). Each domain has individual structural char- 
acteristics which can be predicted: 

Domain 1 

20 Contains 4 cysteines involved in SS2 (C41 to C54) and SS3 (C44 to C62) disulfide bonds. Although no SS1 bond 

is evident based on disulfide bridges, the conserved tyrosine at position 28 is homologous to Y20 in TNFR1 , which is 
known to be involved in interacting with H66 to aid in domain formation. OPG has a homologous histidine at position 
75, suggesting OPG Y28 and H75 stack together in the native protein, as do the homologous residues in TNFR1 . 
Therefore, both of these residues may indeed be important for biological activity, and N-terminal OPG truncations up 

25 to and beyond Y28 may have altered activity In addition, residues E34 and K43 are predicted to interact with a bound * 
ligand based on our 3-dimensional model. 

Domain 2 

30 Contains six cysteines and is predicted to contain SS1 (C65 to C80), SS2 (C83 to C98) and SS3 (C87 to C105) 

disulfide bonds. This region of OPG also contains an region stretching from P66-Q91 which aligns to the portion of 
TNFR1 domain 2 which forms close contacts with TNFp (see above), and may interact with an OPG ligand. In particular 
residues P66, H68, Y69, Y70, T71 , D72, S73, H75, T76, S77, D78, E79, L81 , Y82, P85, V86, K88, E89, L90, and Q91 
are predicted to interact with a bound ligand based on our structural data. 

35 

Domain 3 

Contains 4 cysteines involved in SS1 (C107 to C 118) and SS3 (C 124 to C142) disulfide bonds, but not an SS2 
bond. Based on our structural data, residues E115, L118 and K119 are predicted in to interact with an OPG ligand. 

40 

Domain 4 

Contains 4 cysteines involved in SS1 (C145 to C160) and SS3 (C166 to C185) disulfide bonds, but not an SS2 
bond, similar to domain 3. Our structural data predict that E153 and S155 interact with an OPG ligand. 
45 Thus, the predicted structural model for OPG identifies a number of highly conserved residues which are likely to 

be important for its biological activity. 

EXAMPLE 7 

50 Production of recombinant secreted OPG protein in mammalian cells 

To determine if OPG is actually a secreted protein, mouse OPG cDNA was fused to the human lgG1 Fc domain 
as a tag (Capon et al. Nature 337 , 525-531 (1 989)), and expressed in human 293 fibroblasts. Fc fusions were carried 
out using the vector pFc-A3. pFc-A3 contains the region encoding the Fc portion of human immunoglobulin IgG^yl 
55 heavy chain (Ellison et al. ibid ) from the first amino acid of the hinge domain (Glu-99) to the carboxyl terminus and is 
flanked by a S'-Notl fusion site and 3'-Sall and Xbal sites. The plasmid was constructed by PCR amplification of the 
human spleen cDNA library (Clontech). PCR reactions were in a final volume of 100 uJ and employed 2 units of Vent 
DNA polymerase (New England Biolabs) in 20 mM Tris-HCI (pH 8.8), 10 mM KCI, 10 uJvl (NH 4 )2S0 4 , 2 mM MgS0 4 , 
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0 1% Triton X-100 with 400 jaM each dNTP and 1 ng of the cDNA library to be amplified together with 1 u.M of each 
primer. Reactions were initiated by denaturation at 95°C for 2 min, followed by 30 cycles of 95°C for 30 s, 55*C for 30 
s.and 73°C for 2 min. The 5' primer 

5 5' ATAGCGGCCGCTGAGCCCAAATCTTGTGACAAAACTCAC 3' (SEQ ID NO: 24) 

incorporated a Notl site immediately 5' to the first residue (Glu-99) of the hinge domain of IgG-yl. The 3' primer 
10 5 • -TCTAGAGTCGACTTATCATTTACCCGGAGACAGGGAGAGGCTCTT-3 ' (SEQ ID NO: 25) 

incorporated Sail and Xbal sites. The717-bp PCR product was digested with Notl and Sail, isolated by electrophoresis 
through 1% agarose (FMC Corp.),purified by the Geneclean procedure (BIO 101, Inc.) and cloned into Notl, Sail- 
is digested pBluescript II KS vector (Stratagene). The insert in the resulting plasmid, pFc-A3, was sequenced to confirm 
the fidelity of the PCR reaction. 

The cloned mouse cDNA in plasmid pRcCMV-MuOPG was amplified using the following two sets of primer pairs: 
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Pair 1 

5 f -CCTCTGAGCTCAAGCTTCCGAGGACCACAATGAACAAG-3 ' (SEQ ID N O: 26) 

5 • -CCTCTGCGGCCGCTAAGCAGCTTATTTTCACGGATTGAACCTG-3 9 (SEQ ID NO: 27) 



Pair 2 

5 • -CCTCTGAGCTCAAGCTTCCGAGGACCACAATGAACAAG-3 ■ (SEQ ID NO: 28) 
30 5 ' -CCTCTGCGGCCGCTGTTGCATTTCCTTTCTG— 3 1 (SEQ ID NO: 30) 

The first pair amplifies the entire OPG LORF, and creates a Notl restriction site which is compatible with the in- 
frame Not I site in Fc fusion vector pFcA3. pFcA3 was prepared by engineering a Notl restriction site 5' to aspartic acid 

35 reside 216 of the human lgG1 Fc cDNA. This construct introduces a linker which encodes two irrelevant ammo acids 
which span the junction between the OPG protein and the IgG Fc region. This product, when linked to the Fc portion, 
would encode all 401 OPG residues directly followed by all 227 amino acid residues of the human lgG1 Fc region (Fl. 
Fc) The second primer pair amplifies the DNA sequences encoding the first 180 amino acid residues of OPG, which 
encompasses its putative ligand binding domain. As above, the 3' primer creates an artificial Not I restriction site which 

40 fuses the C-terminal truncated OPG LORF at position threonine 180 directly to the lgG1 Fc domain (CT.fc). 

The amino acid sequence junction linking OPG residue 401 and aseptic acid residue 221 of the human Fc region 
can be modified as follows: The DNA encoding residues 216-220 of the human Fc region can be deleted as described 
below or the cysteine residue corresponding to C220 of the human Fc region can be mutated to either serine or alanine. 
OPF-Fc fusion protein encoded by these modifed vectors can be transfected into human 293 cells, or CHO cells, and 

45 recombinant OPG-Fc fusion protein purified as described below. 

Both products were directionally cloned into the plasmid vector pCEP4 (Invitrogen). pCEP4 contains the Epstein- 
Barr virus origin of replication, and is capable of episomal replication in 293-EBNA-1 cells. The parent pCEP4, and 
pCEP4-FI.Fc and pCEP4-CT.Fc vectors were lipofected into293-EBNA-1 cells using the manufacturer's recommended 
methods The transfected cells were then selected in 100 u.g/ml hygromycin to select for vector expression, and the 

so resulting drug- resistant mass cultures were grown to confluence. The cells were then cultured in serum-free media for 
72 hr and the conditioned media removed and analysed by SDS-PAGE. A silver staining of the polyacrylamide gel 
detects the major conditioned media proteins produced by the drug resistant 293 cultures. In the pCEP4-FI.Fc and the 
pCEP4-CTFc conditioned media, unique bands of the predicted sizes were abundantly secreted (see Figures 1 3B and 
13C) The full-length Fc fusion protein accumulated to a high concentration, indicating that it may be stable. Both Fc 

55 fusion proteins were detected by anti-human lgG1 Fc antibodies (Pierce) on western blots, indicating that they are 
recombinant OPG products. 

The full length OPG-Fc fusion protein was purified by Protein-A column chromatography (Pierce) using the man- 
ufacturers recommended procedures. The protein was then subjected to N-terminal sequence analysis by automated 
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Edman degradation as essentially described by Matsudaira et al. (J. Biol. Chem. 262 , 10-35 (1987)). The following 
amino acid sequence was read after 1 9 cycles: 



5 NH2-E TLPPKYLHYDPETGHQL L-C02H 

(SEQIDNO:31) 

This sequence was identical to the predicted mouse OPG amino acid sequence beginning at amino acid residue 
10 22, suggesting that the natural mammalian leader cleavage site is between amino acid residues Q21 -E22, not between 
Y31-D32 as originally predicted. The expression experiments performed in 293-EBNA cells with pCEP4-FLFc and 
pCEP4-CT.Fc demonstrate that OPG is a secreted protein, and may act system ically to bind its ligand. 

Procedures similar to those used to construct and express the muOPG[22-1 80]-Fc and muOPG[22-401 ]-Fc fusions 
were employed for additional mouse and human OPG-Fc fusion proteins. 
is Murine OPG cDNA encoding amino acids 1-185 fused to the Fc region of human lgG1 [muOPG Ct(185).Fc] was 

constructed as follows. Murine OPG cDNA from plasmid pRcCMV Mu Osteoprotegerin (described in Example 5) was 
amplified using the following primer pair in a polymerase chain reaction as described above: 
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30 



45 



50 



55 



1333-82: 

5'-TCC CTT GCC CTG ACC ACT CTT-3' (SEQ ID NO: 32) 
1333-80: 



5'-CCT CTG CGG CCG CAC ACA CGT TGT CAT GTG TTG C-3' 
(SEQ ID NO: 33) 



This primer pair amplifies the murine OPG cDNA region encoding amino acid residues 63-185 (corresponding to 
bp 278-645) of the OPG reading frame as shown in Figure 9A. The 3* primer contains a Not I restriction site which is 
compatible with the in-frame Not I site of the Fc fusion vector pFcA3. The product also spans a unique EcoRI restriction 

35 site located at bp 436. The amplified PCR product was purified, cleaved with Not! and EcoRI, and the resulting EcoRI- 
Notl restriction fragment was purified. The vector pCEP4 having the murine 1-401 OPG-Fc fusion insert was cleaved 
with EcoRI and Notl, purified, and ligated to the PCR product generated above. The resulting pCEP4-based expression 
vector encodes OPG residues 1-185 directly followed by all 227 amino acid residues of the human lgG1 Fc region. 
The murine OPG 1-185.Fc fusion vector was transfected into 293 cells, drug selected, and conditioned media was 

40 produced as described above. The resulting secreted murine OPG 1-185.Fc fusion product was purified by Protein-A 
column chromatography (Pierce) using the manufacturers recommended procedures. 

Murine OPG DNA encoding amino acid residues 1-194 fused to the Fc region of human lgG1 (muOPG Ct(194). 
Fc) was constructed as follows. Mouse OPG cDNA from plasmid pRcCMV Mu-Osteoprotegerin was amplified using 
the following primer pairs: 



1333-82: 

5'-TCC CTT GCC CTG ACC ACT CTT-3' (SEQ ID NO: 34) 
1333-81: 

5'-CCT CTG CGG CCG CCT TTT GCG TGG CTT CTC TGT T-3' 

(SEQ ID NO: 35) 



This primer pair amplifies the murine OPG cDNA region encoding amino acid residues 70-194 (corresponding to 
bp 298-672) of the OPG reading frame. The 3' primer contains a Not I restriction site which is compatible with the in- 
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frame Not I site of the Fc fusion vector P FcA3. The product also spans a unique EcoRI restriction site located at bp 
436 The amplified PCR product was cloned into the murine OPG[1-401] Fc fusion vector as described above. The 
resulting pCEP4-based expression vector encodes OPG residues 1 -1 94 directly followed by all 227 amino ac.d residues 
of the human lgG1 Fc region. The murine OPG 1-1 94. Fc fusion vector was transfected into 293 cells, drug selected, 
and conditioned media was produced. The resulting secreted fusion product was purified by Protein-A column chro- 
matography (Pierce) using the manufacturers recommended procedures. 

Human OPG DNA encoding amino acids 1 -401 fused to the Fc region of human lgG1 was constructed as follows. 
Human OPG DNA in plasmid pRcCMV-hu osteoprotegerin (described in Example 5) was amplified using the following 
oligonucleotide primers: 

1254-90: 

5'CCT CTG AGC TCA AGC TTG GTT TCC GGG GAC CAC AAT G-3'(SEQ ID NO: 36) 

is 1254-95: 

5'-CCT CTG CGG CCG CTA AGC AGC TTA TTT TTA CTG AAT GG-3' 

(SEQ ID NO: 37) 

20 The resulting PCR product encodes the full-length human OPG protein and creates a Not I restriction site which 

is compatible with the in-frame Not I site Fc fusion vector FcA3. The PCR product was directional V cloned into the 
plasmid vector P CEP4 as described above. The resulting expression vector encodes human OPG residues 1-401 
directly followed by 227 amino acid residues of the human lgG1 Fc region. Conditioned media from transfected and 
drug selected cells was produced and the huOPG FI.Fc fusion product was purified by Protein-A column chromatog- 

25 raphy (Pierce) using the manufacturers recommended procedures. rtrv , 

Human OPG DNA encoding amino acid residues 1-201 fused to the Fc region of human lgG1 [huOPG Ct(201V 
Fc] was constructed as follows. The cloned human OPG cDNAfrom plasmid pRrCMV-hu osteoprotegenn was amplified 
by PCR using the following oligonucleotide primer pair: 



30 



35 



40 



1254-90: 

5'-CCT CTG AGC TCA AGC TTG GTT TCC GGG GAC CAC AAT 
G-3' (SEQ ID NO: 38) 
1254-92: 

5'-CCT CTG CGG CCG CCA GGG TAA CAT CTA TTC CAC-3' 

(SEQ ID NO: 39) 



This primer pair amplifies the human OPG cDNA region encoding amino acid residues 1 -201 of the OPG reading 
frame and creates a Not I restriction site at the 3' end which is compatable with the in-frame Not I site Fc fusion vector 
FcAs'-This product, when linked to the Fc portion, encodes OPG residues 1-201 directly followed by all 221 am.no 
acid residues of the human lgG1 Fc region. The PCR product was directionally cloned into the plasmid vector pCEP4 
45 as described above. Conditioned media from transfected and drug selected cells was produced, and the hu OPG Ct 
(201 ).Fc fusion products purified by Protein-A column chromatography (Pierce) using the manufacturer's recommend- 
ed procedures. 

The following procedures were used to construct and express unfused mouse and human OPG. 
A plasmid for mammalian expression of full-length murine OPG (residues 1-401) was generated by PCR ampl.fi- 
so cation of the murine OPG cDNA insert from pRcCMV Mu-Osteoprotegerin and subcloned into the expression vector 
pDSRcc (DeClerck et. atl. J. Biol. Chem. 266, 3893 (1991)). The following oligonucleotide primers were used: 
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1295-26: 

5'-CCG AAG CTT CCA CCA TGA ACA AGT GGC TGT GCT 

GC-3' (SEQIDNO: 40) 
1295-27: 

5'-CCT CTG TCG ACT ATT ATA AGC AGC TTA TTT TCA CGG 
ATT G-3' (SEQIDNO: 41) 



The murine OPG full length reading frame was amplified by PCR as described above. The PCR product was 
purified and digested with restriction endonucleases Hind Hi and Xba I (Boehringer Mannheim, Indianapolis, IN) under 
the manufacturers recommended conditions, then ligated to Hind III and Xba I digested pDSRa. Recombinant clones 

is were detected by restriction endonuclease digestion, then sequenced to ensure no mutations were produced during 
the PCR amplification steps. 

The resulting plasmid, pDSRa-muOPG was introduced into Chinese hamster ovary (CHO) cells by calcium me- 
diated transfection (Wigler et al. Cell V[, 233 (1977)). Individual colonies were selected based upon expression of the 
dihydrofolate reductase (DHFR) gene in the plasmid vector and several clones were isolated. Expression of the murine 

20 OPG -recombinant protein was monitored by western blot analysis of CHO cell conditioned media. High expressing 
cells were selected, and OPG expression was further amplified by treatment with methotrexate as described (DeClerck 
et al:, jdid). Conditioned media from CHO cell lines was produced forf urther purification of recombinant secreted murine 
OPG protein. 

A plasmid for mammalian expression of full-length human OPG (amino acids 1-401 ) was generated by subcloning 
25 the cDNA insert in pRcCMV-hu Osteoprotegerin directly into vector pDSRa (DeClerck et al., ibid ). The pRcCMV-OPG * 
plasmid was digested to completion with Not t, blunt ended with Kl enow, then digested to completion with Xba I. Vector 
DNA was digested with Hind III, blunt ended with Klenow, then digested with Xba I, then ligated to the OPG insert. 
Recombinant plasmids were then sequenced to confirm proper orientation of the human OPG cDNA. 

The resulting plasmid pDSRot-huOPG was introduced into Chinese hamster ovary (CHO) cells as described above. 
30 Individual colonies were selected based upon expression of the dihydrofolate reductase (DHFR) gene in the plasmid 
vector and several clones were isolated. Expression of the human OPG recombinant protein was monitored by western 
blot analysis of CHO cell conditioned media. High expressing clones were selected, and OPG expression was further 
amplified by treatment with methotrexate. Conditioned media from CHO cell lines expressing human OPG was pro- 
duced for protein purification. 

35 Expression vectors for murine OPG encoding residues 1-185 were constructed as follows. Murine OPG cDNA 

from pRcCMV-Mu OPG was amplified using the following oligonucleotide primers: 



1333-82: 

5'-TCC CTT GCC CTG ACC ACT CTT-3' (SEQ ID NO: 42) 
1356-12: 

5'-CCT CTG TCG ACT TAA CAC ACG TTG TCA TGT GTT 

GC-3' (SEQ ID NO: 43) 

This primer pair amplifies the murine OPG cDNA region encoding amino acids 63-185 of the OPG reading frame 
(bp 278-645) and contains an artificial stop codon directly after the cysteine codon (C185), which is followed by an 

50 artificial Sal I restriction endonuclease site. The predicted product contains an internal Eco Rl restriction site useful for 
subcloning into a pre-existing vector. After PCR amplification, the resulting purified product was cleaved with Eco Rl 
and Sal I restriction endonucleases, and the large fragment was gel purified. The purified product was then subcloned 
into the large restriction fragment of an Eco Rl and Sal I digest of pBluescript-muOPG FLFc described above. The 
resulting plasmid was digested with Hind III and Xho I and the small fragment was gel purified. This fragment, which 

55 contains a open reading frame encoding residues 1-185 was then subcloned into a Hind Ml and Xho I digest of the 
expression vector pCEP4. The resulting vector, pmuOPG [1 -1 85], encodes a truncated OPG polypeptide which termi- 
nates at a cysteine residue located at position 185. Conditioned media from transfected and drug selected cells was 
produced as described above. 
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1333-82: 

5'-TCC CTT GCC CTG ACC ACT CTT-3' (SEQ ID NO: 44) 



1356-13: 

5'-CCT CTG TCG ACT TAC TTT TGC GTG GCT TCT CTG 

TT-3' (SEQ ID NO: 45) 



This primer pair amplifies the murine OPG cDNA region encoding amino acids 70-194 of the OPG reading frame 
(bp 298-672) and contains an artificial stop codon directly after the lysine codon (K1 94), wh ich is followed by an artificial 
Sal I restriction endonuclease site. The predicted product contains an internal Eco Rl restriction site useful for sub- 
cloning into a pre-existing vector. After PCR amplification, the resulting purified product was cleaved with Eco Rl and 
Sal I restriction endonucleases, and the large fragment was gel purified. The purified product was then subcloned into 
the large restriction fragment of an Eco Rl and Sal I digest of pBluescript-muOPG FI.Fc described above. The resulting 
plasmid was digested with Hind III and Xho I and the small fragment was gel purified. This fragment, which contains 
a open reading frame encoding residues 1-185 was then subcloned into a Hind III and Xho I digest of the expression 
vector pCEP4. The resulting vector, pmuOPG [1-185], encodes a truncated OPG polypeptide which terminates at a 
lysine at position 1 94. Conditioned media from transf ected and drug selected cells was produced as described above. 

Several mutations were generated at the 5' end of the huOPG [22-401 ]-Fc gene that introduce either amino acid 
substitutions, or deletions, of OPG between residues 22 through 32. All mutations were generated with the "Quick- 
Change™ Site-Directed Mutagenesis Kit" (Stratagene, San Diego, CA) using the manfacturer's recommended condi-* 
tions. Briefly, reaction mix containing huOPG [22-401 ]-Fc plasmid DNA template and mutagenic primers were treated 
with Pfu polymerase in the presence of deoxynucleotides, then amplified in a thermocycler as described above. An 
aliqout of the reaction is then transfected into competent E. coli XL1-Blue by heatshock, then plated. Plasmid DNA 
from transformants was then sequenced to verify mutations. 

The following primer pairs were used to delete residues 22-26 of the human OPG gene, resulting in the production 
of a huOPG [27-401 ]-Fc fusion protein: 



1436-11: 

5'-TGG ACC ACC CAG AAG TAC CTT CAT TAT GAC-3'(SEQ ID NO: 140) 
1436-12: 

S'-GTC ATA ATG AAG GTA CTT CTG GGT GGT CCA-3' (SEQ ID NO: 141) 

The following primer pairs were used to delete residues 22-28 of the human OPG gene, resulting in the production 
of a huOPG [29-401]-Fc fusion protein: 



1436-17: 

5'-GGA CCA CCC AGO TTC ATT ATG ACG AAG AAA C-3'(SEQ ID NO: 142) 
1436-18: 

5'-GTT TCT TCG TCA TAA TGA AGC TGG GTG GTC C-3' (SEQ ID NO: 143) 



The following primer pairs were used to delete residues 22-31 of the human OPG gene, resulting in the production 
of a huOPG [32-401]-Fc fusion protein: 
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1436-27: 

5'-GTG GAC CAC CCA GGA CGA AGA AAC CTC TC-3' (SEQ ID NO: 144) 
5 1436-28: 

5' -GAG AGG TTT CTT CGT CCT GGG TGG TCC AC-3' (SEQ ID NO: 145) 

The following primer pairs were used to change the codon for tyrosine residue 28 to phenylalanine of the human 
10 OPG gene, resulting in the production of a huOPG [22-401 ]-Fc Y28F fusion protein: 

1436-29: 

5' -CGT TTC CTC CAA AGT TCC TTC ATT ATG AC-3' (SEQ ID NO: 146) 

75 

1436-30: 

5'-GTC ATA ATG AAG GAA CTT TGG AGG AAA CG-3' (SEQ ID NO: 147) 

20 The following: primer pairs were used to change the codon for proline residue 26 to alanine of the human OPG 

gene, resulting in the production of a huOPG [22-401 ]-Fc P26A fusion protein: 

1429-83: 

25 5' -GGA AAC GTT TCC TGC AAA GTA CCT TCA TTA TG-3 (SEQ ID NO: 148) * 

1429-84: 

5' -CAT AAT GAA GGT ACT TTG CAG GAA ACG TTT CC-3'(SEQ ID NO: 149) 

30 

Each resulting rnuOPG [22-401 ]-Fc plasmid containing the appropriate mutation was then transfected into human 
293 cells, the mutant OPG-Fc fusion protein purifiedf rom conditioned media as described above. The biological activity 
of each protein was assessed the in vitro osteoclast forming assay described in Example 11 . 

35 EXAMPLE 8 

Expression of OPG in E. coli 

A. Bacterial Expression Vectors 

40 

PAMG21 

The expression plasmid pAMG21 can be derived from the Amgen expression vector pCFM1656 (ATCC #69576) 
which in turn be derived from the Amgen expression vector system described in US Patent No. 4 : 71 0,473. The 
45 pCFM1656 plasmid can be derived from the described pCFM836 plasmid (Patent No. 4,710,473) by: (a) destroying 
the two endogenous Ndel restriction sites by end filling with T4 polymerase enzyme followed by blunt end ligation; (b) 
replacing the DNA sequence between the unique Aatll and Clal restriction sites containing the synthetic P L promoter 
with a similar fragment obtained from pCFM636 (patent No. 4,710,473) containing the PL promoter 

so 
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5' ~ CTAATTCCGCTCTCACCTACCAAACAATGCCCCCCTGCAAAAAATAAATTCATAT- 
3 ' TGCAGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGGGACGTTTTTTATTTAAGTATA- 



-AAAAAACATACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACATAAA- 
-TTTTTTGTATGTCTATTGGTAGACGCCACTATTTAATAGAGACCGCCACAACTGTATTT- 

10 -TACCACTGGCGGTGATACTGAGCACAT 3' (SEQ ID NO: 53) 

- ATGGTGACCGCCACTATGACTCGTGTAGC5 ' (SEQ ID NO: 54) 

Clal 

is and then (c) substituting the small DN A sequence between the unique Clal and Kpnl restriction sites with the following 
oligonucleotide: 



5 ' CGATTTGATTCTAGAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGGTAC 3 ' (SEQ ID N O: 48) 

20 3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 5 ' (SEQ ID N O: 49) 

Clal Kpnl 



25 The expression plasmid pAMG21 can then be derived from pCFM1 656 by making aseries of site directed base changes 
by PCR overlapping oligo mutagenesis and DNA sequence substitutions. Starting with the Bglll site (plasmid bp # 1 80) 
immediately 5' to the plasmid replication promoter PcopB and proceeding toward the plasmid replication genes, the 
base pair changes are as follows: 
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PAMG21 frp_f bp iD PCFM1656 bp changed to in pAfclg2J 



5 



15 



# 204 


T/A 


C/G 


# 428 


A/T 


G/C 


# 509 


G/C 


A/T 


# 617 






# 679 


G/C 


T/A 


# 980 


T/A 


C/G 


# 994 


G/C 


A/T 


# 1004 


A/T 


C/G 


# 1007 


C/G 


T/A 


# 1028 


A/T 


T/A 


# 1047 


C/G 


T /A 


♦ 1178 


G/C 


T/A 


# 1466 


G/C* 


T/A 


# 2028 


G/C 


up acietion 


# 2187 


p /fi 


T / 


# 2480 


A/T 


T/A 


# 2499-2502 


AGTG 


GTCA 




TCAC 


CAGT 


# 2642 


TCCGAGC 


7 bp deletion 




AGGCTCG 




# 3435 


G/C 


A/T 


# 3446 


G/C 


A/T 


# 3643 


A/T 


T/A 



30 

The DNA sequence between the unique Aatll (position #4364 in pCFM1656) and Sacll (position #4585 in pCFM1656) 
restriction sites is substituted with the following DNA sequence: 

35 
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[Xatll sticky end] 5« GCGTAACGTATGCATGGTCTCC- 

(position #4358 in pAMG21) 3' TGCACGCATTGCATACGTACCAGAGG- 

-CCATGCGAGAGTAGGGAACTGCCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACT- 
-GGTACGCTCTCATCCCTTGACGGTCCGTAGTTTATTTTGCTTTCCGAGTCAGCTTTCTGA- 

-GGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGC- 
-CCCGGAAAGCAAAATAGACAACAAACAGCCACTTGCGAGAGGACTCATCCTGTTTAGGCG- 

-CGGGAGCGGATTTGAACGTTGCGAAGCAACGGCCCGGAGGGTGGCGGGCAGGACGCCCGC- 
-GCCCTCGCCTAAACTTGCAACGCTTCGTTGCCGGGCCTCCCACCGCCCGTCCTGCGGGCG- 

-CATAAACTGCCAGGCATCAAATTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGT- 
-GTATTTGACGGTCCGTAGTTTAATTCGTCTTCCGGTAGGACTGCCTACCGGAAAAACGCA- 

Aatll 

-TTCTACAAACTCTTTTGTTTATTTTTCTAAATACATTCAAATATGGACGTCGTACTTAAC- 
-AAGATGTTTGAGAAAACAAATAAAAAGATTTATGTAAGTTTATACCTGCAGCATGAATTG- 

20 -TTTTAAAGTATGGGCAATCAATTGCTCCTGTTAAAATTGCTTTAGAAATACTTTGGCAGC- 
-AAAATTTCATACCCGTTAGTTAACGAGGACAATTTTAACGAAATCTTTATGAAACCGTCG- 

-GGTTTGTTGTATTGAGTTTCATTTGCGCATTGGTTAAATGGAAAGTGACCGTGCGCTTAC^ 
-CCAAACAACATAACTCAAAGTAAACGCGTAACCAATTTACCTTTCACTGGCACGCGAATG- 



10 



15 



25 



35 



40 



45 



50 



55 



-TACAGCCTAATATTTTTGAAATATCCCAAGAGCTTTTTCCTTCGCATGCCCACGCTAAAC- 
-ATGTCGGATTATAAAAACTTTATAGGGTTCTCGAAAAAGGAAGCGTACGGGTGCGATTTG- 



-ATTCTTTTTCTCTTTTGGTTAAATCGTTGTTTGATTTATTATTTGCTATATTTATTTTTC- 
3Q - TAAGAAAAAGAGAAAACC AATTT AGC AAC AAACT AAAT AAT AAACGAT AT AAAT AAAAAG - 
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-GATAATTATCAACTAGAGAAGGAACAATTAATGGTATGTTCATACACGCATGTAAAAATA- 
-CTATTAATAGTTGATCTCTTCCTTGTTAATTACCATACAAGTATGTGCGTACATTTTTAT- 

-AACTATCTATATAGTTGTCTTTCTCTGAATGTGCAAAACTAAGCATTCCGAAGCCATTAT- 
-TTGATAGATATATCAACAGAAAGAGACTTACACGTTTTGATTCGTAAGGCTTCGGTAATA- 

-TAGCAGTATGAATAGGGAAACTAAACCCAGTGATAAGACCTGATGATTTCGCTTCTTTAA- 
-ATCGTCATACTTATCCCTTTGATTTGGGTCACTATTCTGGACTACTAAAGCGAAGAAATT- 

-TTACATTTGGAGATTTTTTATTTACAGCATTGTTTTCAAATATATTCCAATTAATCGGTG- 
-AATGTAAACCTCTAAAAAATAAATGTCGTAACAAAAGTTTATATAAGGTTAATTAGCCAC- 

-AATGATTGGAGTTAGAATAATCTACTATAGGATCATATTTTATTAAATTAGCGTCATCAT- 
-TTACTAACCTCAATCTTATTAGATGATATCCTAGTATAAAATAATTTAATCGCAGTAGTA- 

-AATATTGCCTCCATTTTTTAGGGTAATTATCCAGAATTGAAATATCAGATTTAACCATAG- 
-TTATAACGGAGGTAAAAAATCCCATTAATAGGTCTTAACTTTATAGTCTAAATTGGTATC- 

-AATGAGGATAAATGATCGCGAGTAAATAATATTCACAATGTACCATTTTAGTCATATCAG- 
-TTACTCCTATTTACTAGCGCTCATTTATTATAAGTGTTACATGGTAAAATCAGTATAGTC- 

-ATAAGCATTGATTAATATCATTATTGCTTCTACAGGCTTTAATTTTATTAATTATTCTGT- 
-TATTCGTAACTAATTATAGTAATAACGAAGATGTCCGAAATTAAAATAATTAATAAGACA- 

-AAGTGTCGTCGGCATTTATGTCTTTCATACCCATCTCTTTATCCTTACCTATTGTTTGTC- 
- T TC AC AGC AGCCGTAAAT AC AGAAAGT ATGGGT AGAG AAAT AGG AATGG AT AAC AAAC AG- 

-GCAAGTTTTGCGTGTTATATATCATTAAAACGGTAATAGATTGACATTTGATTCTAATAA- 
-CGTTCAAAACGCACAATATATAGTAATTTTGCCATTATCTAACTGTAAACTAAGATTATT- 

-ATTGGATTTTTGTCACACTATTATATCGCTTGAAATACAATTGTTTAACATAAGTACCTG- 
-TAACCTAAAAACAGTGTGATAATATAGCGAACTTTATGTTAACAAATTGTATTCATGGAC- 

-TAGGATCGTACAGGTTTACGCAAGAAAATGGTTTGTTATAGTCGATTAATCGATTTGATT- 
-ATCC T AGC ATGTCCAAATGCGTTCTTTTACCAAACAATATCAGCTAATTAGCT AAAC T AA- 

'CTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGA- 
-GATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCT- 

SacII 

-GCTCACTAGTGTCGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAA- 
-CGAGTGATCACAGCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTT- 

-GAAGAAGAAGAAGAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATA- 
-CTTCTTCTTCTTCTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTAT- 

-ACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGG- 
-TGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCC- 

-AACCGCTCTTCACGCTCTTCACGC 3' [SacII sticky end] (SEQ ID NO: 46' 



During the ligation of the sticky ends of this substitution DNA sequence, the outside Aatll and SacII sites are destroyed. 
There are unique Aatll and SacII sites in the substituted DNA. 



-TTGGCGAGAAGTGCGAGAAGTG 
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pAMG22-His 

The expression plasmid pAMG22-His can be derived from the Amgen expression vector pAMG22 by substituting 
the small DNA sequence between the unique Ndel ( #4795) and EcoRI ( #4818) restriction sites of pAMG22 with the 
s following oligonucleotide duplex: 

Ndel NhftI EcoRI 

5 ' TATGAAACATCATCACCATCACCATCATGCTAGCGTTAACGCGTTGG 3 • (SEQ ID NO: 51) 

10 3 • ACTTTGTAGTAGTGGTAGTGGTAGTACGATCGCAATTGCGCAACCTTAA 5 ' (SEQ ID NO: 52) 

MetLysHisHisHisHisHisHisHisAlaSerValAsnAlaLeuGlu i (SEQ ID NO: 108) 

is pAMG22 

The expression plasmid pAMG22 can be derived from the Amgen expression vector pCFM1656 (ATCC #69576) 
which in turn be derived from the Amgen expression vector system described in US Patent No. 4,710,473 granted 
December 1, 1987. The pCFM1656 plasmid can be derived from the described pCFM836 plasmid (Patent No. 
20 4,710,473) by: (a) destroying the two endogenous Ndel restriction sites by end filling with T4 polymerase enzyme 
followed by blunt end ligation; (b) replacing the DNA sequence between the unique Aatll and Clal restriction sites 
containing the synthetic PL promoter with a similar fragment obtained from pCFM636 (patent No. 4,710,473) containing 
the PL promoter 

25 

Aatll 

5 ' CT AAT TCCGCTCTC ACCT ACC AAAC AATGCCCCCCTGC AAAAAAT AAAT TCAT AT — 

3 ' TGCAGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGGGACGTTTTTTATTT AAGTATA- 

30 



35 

-AAAAAAGATACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACATAAA- 
-TTTTTTGTATGTCTATTGGTAGACGCCACTATTTAATAGAGACCGCCACAACTGTATTT- 

40 -T ACC ACTGGCGGTG AT ACTGAGC AC AT 3' (SEQ ID NO: 53) 

-ATGGTGACCGCCACTATGACTCGTGTAGCS' (SEQ ID NO: 54) 

Clal 



45 



SO 



55 



and then (c) substituting the small DNA sequence between the unique Clal and Kpnl restriction sites with the following 
oligonucleotide: 



5 ' CGATTTGATTCTAGAAGGAGGAAT AACATATGGTTAACGCGTTGGAATTCGGTAC 3' (SEQ ID NO: 55) 
3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 5 ' (SEQ ID NO: 56) 

Clal Kpnl 

The expression plasmid pAMG22 can then be derived from pCFM1 656 by making a series of site directed base changes 
by PCR overlapping oligo mutagenesis and DNA sequence substitutions. Starting with the Bglll site (plasmid bp # 1 80) 
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immediately 5' to the plasmid replication promoter PcopB and proceeding toward the plasmid replication genes, the 
base pair changes are as follows: 



bp changed to in 

PAMG22. bp # bp in PCFM1656 



io # 204 T/A C/G 

# 428 A/T G/C 

# 509 G/C A/T 

# 617 - - insert two G/C 



15 



bp 



# 679 


G/C 


T/A 


# 980 


T/A 


C/G 


# 994 


G/C 


A/T 


# 1004 


A/T 


C/G 


# 1007 


C/G 


T/A 


# 1028 


A/T 


T/A 


# 1047 


C/G 


T/A 


# 1178 


G/C 


T/A 


# 1466 


G/C 


T/A 


# 2028 


G/C 


bp deletion 


# 2187 


C/G 


T/A . 


# 2480 


A/T 


T/A 


# 2499-2502 


AGTG 


GTCA 




TCAC 


CAGT 


# 2642 . 


TCCGAGC 


7 bp deletion 




AGGCTCG 




# 3435 


G/C 


A/T 


# 3446 


G/C 


A/T 


# 3643 


A/T 


T/A 



The DNA sequence between the unique Aatll (position #4364 in pCFM1656) and Sacll (position #4585 in pCFM1656) 
45 restriction sites is substituted with the following DNA sequence: 



so 



55 
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[Aatll sticky end] (position #4358 in pAMG22) 

5 ' GCGTAACGTATGCATGGTCTCCCCATGCGAGAGTAGGGAACTGCCAGGCATCAA- 
3 1 T GC AC GC ATT GC AT ACGTACC AG AGGGGT AC GCTCTCATCCCTTG ACGGTCCGTAGTT- 

-ATAAAACGAAAGGCTCAGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTG- 
-TATTTTGCTTTCCGAGTCAGCTTTCTGACCCGGAAAGCAAAATAGACAACAAACAGCCAC- 

-AACGCTCTCCTGAGTAGGACAAATCCGCCGGGAGCGGATTTGAACGTTGCGAAGCAACGG- 
-TTGCGAGAGGACTCATCCTGTTTAGGCGGCCCTCGCCTAAACTTGCAACGCTTCGTTGCC- 

^CCCGGAGGGTGGCGGGCAGGACGCCCGCCATAAACTGCCAGGCATCAAATTAAGCAGAAG- 
-GGGCCTCCCACCGCCCGTCCTGCGGGCGGTATTTGACGGTCCGTAGTTTAATTCGTCTTC- 

-GCCATCCTGACGGATGGCCTTTTTGCGTTTCTACAAACTCTTTTGTTTATTTTTCTAAAT- 
-CGGTAGGACTGCCTACCGGAAAAACGCAAAGATGTTTGAGAAAACAAATAAAAAGATTTA- 

Aatll 

-ACATTCAAATATGGACGTCTCATAATTTTTAAAAAATTCATTTGACAAATGCTAAAATTC- 



-TGTAAGTTTATACCTGCAGAGTATTAAAAATTTTTTAAGTAAACTGTTTACGATTTTAAG- 

^TTGATTAATATTCTCAATTGTGAGCGCTCACAATTTATCGATTTGATTCTAGATTTGTTT- 
-AACTAATTATAAGAGTTAACACTCGCGAGTGTTAAATAGCTAAACTAAGATCTAAACTCA- 

-TAACTAATTAAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGAGCTCACTAGTGT- 
-ATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCTCGAGTGATCACA- 

SacII 

-CGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAAGAAGAAGAAGAA- 
•GCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTTCTTCTTCTTCTT- 

-GAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATAACTAGCATAACC- 
-CTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTATTGATCGTATTGG- 

-CCT TGGGGCCTCT AAACGGGTCTTG AGGGGT TTTTTGCTGAAAGG AGGAACCGCTCTTC A— 
-GGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCCTTGGCGAGAAGT- 

-CGCTCTTCACGC 3 ■ (SEQ ID NO: 58) 
-GCGAGAAGTG 5 "(SEQ ID NO: 57) 

[SacII sticky end] (position #5024 in pAMG22) 

During the ligation of the sticky ends of this substitution DNA sequence, the outside Aatll and SacII sites are destroyed. 
There are unique Aatll and SacII sites in the substituted DNA. 

B. Human OPG Metf32-4011 

In the example, the expression vector used was pAMG21 , a derivative of pCFM1 656 (ATCC accession no. 69576) 
which contains appropriate restriction sites for insertion of genes downstream from the ]ux PR promoter. (See U.S. 
Patent No. 5,169,318 for description of the lux expression system). The host cell used was GM120 (ATCC accession 
no. 55764). This host has the lacIQ promoter and lad gene integrated into a second site in the host chromosome of a 
prototrophic E. cojj K12 host. Other commonly used E^ coli expression vectors and host cells are also suitable for 
expression. 

A DNA sequence coding for an N-terminal methionine and amino acids 32-401 of the human OPG polypeptide 
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was placed under control of the luxPR promoter in the plasmid expression vector pAMG21 as follows. To accomplish 
this, PCR using oligonucleotides #1257-20 and #1257-19 as primers was performed using as a template plasmid 
pRcCMV-Hu OPG DN A containing the human OPG cDNA and thermocycling for 30 cycles with each cycle being: 94°C 
for 20 seconds, followed by 37°C for 30 seconds, followed by 72°C for 30 seconds. The resulting PCR sample was 

5 resolved on an agarose gel, the PCR product was excised, purified, and restricted with Kpni and BamHI restriction 
endonucleases and purified. Synthetic oligonucleotides #1 257-21 and #1 257-22 were phophorylated individually using 
T4 polynucleotide kinase and ATP, and were then mixed together heated at 94°C and allowed to slow cool to room 
temperature to form an oligonucleotide linker duplex containing Ndel and Kpnl sticky ends. The phosphorylated linker 
duplex formed between oligonucleotides #1 257-21 and #1 257-22 containing Ndel and Kpnl cohesive ends (see Figure 

10 1 4A) and the Kpnl and BamHI digested and purified PCR product generated using oligo primers #1 257-20 and #1 257-1 9 
(see above) was directionally inserted between two sites of the plasmid vector pAMG21, namely the Ndel site and 
BamHI site, using standard recombinant DNA methodology (see Figure 14A and sequences below). The synthetic 
linker utilized E. coli codons and provided for a N-terminal methionine. 

Two clones were selected and plasmid DNA isolated, and the human OPG insert was subsequently DNA sequence 

is confirmed. The resulting pAMG21 plasmid containing amino acids 32-401 of the human OPG polypeptide immediately 
preceded in frame by a methionine is referred to as pAMG21-huOPG met[32-401] or pAMG21-huOPG met[ 32-401]. 

01igo#1257-19 

20 5 t -TACGCACTGGATCCTTATAAGCAGCTTATTTTTACTGATTGGAC-3 1 

(SEQIDNO:59) 

25 Oligo#1257-20 

5 f -GTCCTCCTGGTACCTACCTAAAACAAC-3 1 

(SEQ ID NO: 60) 

30 

01igo#1257-21 

5 ' -TATGGATGAAGAAACTTCTCATCAGCTGCTGTGTGATAAATGTCC 
GCCGGGTAC -3 1 (SEQ ID NO: 61) 

35 



01igo#1257-22 

40 5 ' -CCGGCGGACATTTATCACACAGCAGCTGATGAGAAGTTTCTTCATCCA-3 » 

(SEQ ID NO: 47) 

Cultures of pAMG21-huOPG met[32-401] in E. coli GM120 in 2XYT media containing 20 u.g/ml kanamycin were 
incubated at 30° C prior to induction. Induction of huOPG met[32-401] gene product expression from the luxPR promoter 

45 was achieved following the addition of the synthetic autoinducer N-{3-oxohexanoyl)-DL-homoserine lactone to the 
culture media to a final concentration of 30 ng/ml and cultures were incubated at either 30°C or 37°C for a further 6 
hours. After 6 hours, the bacteria! cultures were examined by microscopy for the presence of inclusion bodies and 
were then pelletted by centrifugation. Retractile inclusion bodies were observed in induced cultures indicating that 
some of the recombinant huOPG met[32-401] gene product was produced insolubly in E. colj. Some bacterial pellets 

50 were resuspended in 1 0mM Tris-HCI/pH8, 1 mM EDTA and lysed directly by addition of 2X Laemlli sample buffer to 1 X 
final, and p-mercaptoethanol to 5% final concentration, and analyzed by SDS-PAGE. A substantially more intense 
coomassie stained band of approximately 42kDa was observed on a SDS-PAGE gel containing total cell lysates of 
30°C and 37°C induced cultures versus lane 2 which is a total cell lysate of a 30°C uninduced culture (Figure 14B). 
The expected gene product would be 370 amino acids in length and have an expected molecular* weight of about 42.2 

55 kDa. Following induction at 37°C for 6 hours, an additional culture was pelleted and either processed for isolation of 
inclusion bodies (see below) or processed by microfluidizing. The pellet processed for microfluidizing was resuspended 
in 25mM Tris-HCI/pH8, 0.5M NaCI buffer and passed 20 times through a Microfluidizer Model 11 08 (Microfluidics Corp.) 
and collected. An aliquot was removed of the collected sample (microfluidized total lysate), and the remainder was 
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pelleted at 20 000 x g for 20 minutes. The supernatant following centrifugation was removed (m.crofluid.zed so lub e 
f raction) and the pellet resuspended in a 25mM Tns-HCI/pH8, 0.5M NaCI. 6M urea solution (microflu.d.zed insoluble 
fraction . To an aliquot of either the total soluble, or insoluble fraction was added to an equa ydume of _2X La emaU 
sample buffer and p-mercaptoethanol to 5% final concentration. The samples were then analyzed by SDS-PAGE. A 
sfgnitanramount of recombinant huOPG met[32-401] gene product appeared to be found in the .nsoluble fraction. 
To purify the recombinant protein inclusion bodies were purified as follows: Bacterial cells were separated from media 
by density gradient centrifugation in a Beckman J-6B centrifuge equipped with a JS-4.2 rotor at 4, 900 x g for « 
at 4'C The bacterial pellet was resuspended in 5 ml of water and then diluted to a f.nal volume of 10 ml with water. 
This suspension was transferred to a stainless steel cup cooled in ice and subjected to sonic disruption using a Branson 
Sonifier equipped with a standard tip (power setting^, duty cycle=95%, 80 bursts). The sonicated cell suspension 
was centrfuged in a Beckman Optima TLX ultracentrifuge equipped with a TLA 100.3 rotor a 195,000 x g for tc M0 
minutes at 23°C The supernatant was discarded and the pellet rinsed with a stream of water from a squirt bottle. The 
pellets were collected by scraping with a micro spatula and transferred to a glass homogenizer (15 ml capacrty^ Five 
ml of Percoll solution (75% liquid Percoll, 0.15 M sodium chloride) was added to the homogen.zer and the contents 
1 homogenized until uniformly suspended. The volume was increased to 19.5 ml by the addition o Percoll Ma 
mixed and distributed into 3 Beckman Quick-Seal tubes (1 3 x 32 mm). Tubes were sealed according to manuf acturers 
Auctions. The tubes were spun in a Beckman TLA 1 00.3 rotor at 23°C, 20,000 rpm (21 ,600 x g), 30 minutes. Jhe 
ubes were examined for the appropriate banding pattern. To recover the retractile bodies grad.ent frac K>ns we e 
recovered and pooled, then diluted with water. The inclusion bodies were pelleted by centrrfugation, and the protein 
20 concentration estimated following SDS-PAGE. h ,*»<«»h« 
An aliquot of inclusion bodies isolated as described below was dissolved .nto 1X Laemll. sample buffer wrth 5/o 
B-mercaptoethanol and resolved on a SDS-PAGE gel and the isolated inclusion bodies provide a highly pur.f.ed ^re- 
combinant huOPG[32-401 ] gene product. The major -42 kDa band observed after resolving .nclus.on bod.es on a SDS- 
polyacn/lamide gel was excised from a separate gel and the N-terminal amino acid sequence determined essenfa ly 
25 as described (Matsudaira et al. J. Biol. Chem. 262. 1 0-35 (1 987)). The following sequence was determined after 1 9 
cycles: 
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NH 2 -MDEETSHQLLCDKCPPGTY-COOH (SEQIDNO:62) 

This sequence was found to be identical to the first 19 amino acids encoded by the pAMG21 Hu-OPG met[32-401] 
expression vector, produced by a methionine residue provided by the bacterial expression vector. 

C. Human OPG metr22-4011 

A DNA sequence coding for an N-terminal methionine and amino acids 22 through 401 of human OPG was placed 
under control of the luxPR promoter in a prokaryotic plasmid expression vector pAMG21 as fo lows. Isolated p asm.d 
DNA of P AMG21-huOPG met[32-401] (see Section B) was cleaved with Kpnl and BamHI restr.ct.on endonucleases 
and the Lult^g fragments were resolved on an agarose gel. The B fragment (-1 064 bp fragment) was .solved f rom 
the gel using standard methodology. Synthetic oligonucleotides (oligos) #1267-06 and #1267-07 were phosphorylated 
individually and allowed to form an oligo linker duplex, which contained Ndel and Kpnl cohes.ve ends, us.no , methods 
described in Section B. The synthetic linker duplex utilized E, coH codons and prov.ded for an N-1erm.nal meth.on.ne 
The phosphorylated oligo linker containing Ndel and Kpnl cohesive ends and the isolated -1064 bp fragmen : of 
DAMG21-huOP met[32-401] digested with Kpnl and BamHI restriction endonucleases were directionally inserted be- 
tween me Ndel and BamHI sites of pAMG21 using standard recombinant DNA methodology. The ligation mixture was 
transformed into E. col. host 393 by electroporation utilizing the manufacturer's protocol. Clones were selected plasm.d 
DNA was isolated^ DNA sequencing was performed to verify the DNA sequence of the huOPG-met[22-401] gene. 

Oligo #1267-06 
5 * -TAT GGA AAC TTT TCC TCC AAA ATA TCT TCA TTA TGA TGA 
AGA AAC TTC TCA TCA GCT GCT GTG TGA TAA ATG TCC GCC GGG 
TAC-3' (SEQIDNO-.63) 
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Oligo #1267-07 
5'-CCG GCG GAC ATT TAT CAC ACA GCA GCT GAT GAG AAG TTT 
5 CTT CAT CAT AAT GAA GAT ATT TTG GAG GAA AAG TTT CCA- 3 1 

(SEQIDNO:64) 

Cultures of pAMG21-huOPG-met[22-401] in E. coli host 393 were placed in 2XYT media containing 20 u.g/ml 
10 kanamycin and were incubated at 30°C prior to induction. Induction of recombinant gene product expression from the 
luxPR promoter of vector pAMG21 was achieved following the addition of the synthetic autoinducer N-(3-oxohexanoyl)- 
DL-homoserine lactone to the culture media to a final concentration of 30 ng/ml and incubation at either 30°C or 37°C 
for a further 6 hours. After 6 hours, bacterial cultures were pelleted by centrifugation (=30° C I+6 or 37°C I+6). Bacterial 
cultures were also either pelleted just prior to induction (=30°C Prel) or alternatively no autoinducer was added to a 
is separate culture which was allowed to incubate at 30°C for a further 6 hours to give an uninduced (Ul) culture (=30°C 
Ul). Bacterial pellets of either 30° C Prel, 30° C Ul, 30°C I +6, or37°C I+6 cultures were resuspended, lysed : and analyzed 
by SDS-polyacrylamide gel electrophoresis (PAGE) as described in Section B. Polyacrylamide gels were either stained 
with coomassie blue and/or Western transferred to nitrocellulose and immunoprobed with rabbit anti-mu OPG-Fc pol- 
yclonal antibody as described in Example 10. The level of gene product following induction compared to either an 
20 uninduced (30°C Ul) or p re-induction (30° C Prel) sample. 

D. Murine OPG metf22-4011 

A DNA sequence coding for an N-terminal methionine and amino acids 22 through 401 of the murine (mu) OPG 
25 (OPG) polypeptide was placed under control of the luxPR promoter in a prokaryotic plasmid expression vector pAMG21 ' 
as follows. PCR was performed using oligonucleotides #1 257-1 6 and #1 257-1 5 as primers, plasmid pRcCMV-Mu OPG 
DNA as a template and thermocycling conditions as described in Section B. The PCR product was purified and cleaved 
with Kpnl and BamHI restriction endonucleases as described in Section B. Synthetic oligos #1260-61 and #1260-82 
were phosphorylated individually and allowed to form an oligo linker duplex with Ndel and Kpnl cohesive ends using 
30 methods described in Section B. The synthetic linker duplex utilized ^E. coli codons and provided for an N-terminal 
methionine. The phosphorylated linker duplex formed between oligos #1260-61 and #1260-82 containing Ndel. and 
Kpnl cohesive ends and the Kpnl and BamHI digested and purified PCR product generated using oligo primers 
#1257-16 and #1257-15 were directionally inserted between the Ndel and BamHI sites of pAMG21 using standard 
methodology. The ligation mixture was transformed into E. coli host 393 by electroporation utilizing the manufacturer's 
3S protocol. Clones were selected, plasmid DNA was isolated, and DNA sequencing was performed to verify the DNA 
sequence of the MuOPG met[22-401] gene. • 

Expression of recombinant muOPG met[22-401] polypeptide from cultures of 393 cells harboring plasmid 
pAMG21 -MuOPG met[22-401] following induction was determined using methods described in Section C. 

40 

Oligo #1257-15 

5'-TAC GCA CTG GAT CCT TAT AAG CAG CTT ATT TTC ACG 
GAT TGA AC-3 ■ (SEQ ID NO: 65) 

45 



Oligo #1257-16 

5'-GTG CTC CTG GTA CCT ACC TAA AAC AGC ACT GCA CAG 
TG-S'CSEQ ID NO: 66) 



55 
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Oligo #1260-61 

5 '-TAT GGA AAC TCT GCC TCC AAA ATA CCT GCA TTA CGA 
TCC GGA AAC TGG TCA TCA GCT GCT GTG TGA TAA ATG TGC TCC 
GGG TAC-3 1 (SEQ ID NO: 67) 



10 Oligo #1260-82 

5 f -CCG GAG CAC ATT TAT CAC ACA GCA GCT GAT GAC CAG 
TTT CCG GAT CGT AAT GCA GGT ATT TTG GAG GCA GAG TTT 
« CCA- 3 1 (SEQ ID NO: 68) 

E. Murine OPG metf32-401] 

20 A DNA sequence coding tor an N-terminal methionine and amino acids 32 through 401 of murine OPG was placed 

under control of the luxPR promoter in a prokaryotic plasmid expression vector pAMG21 as follows. To accomplish 
this, Synthetic oligos #1267-08 and #1267-09 were phosphorylated individually and allowed to form an oligo linker 
duplex using methods described in Section B. The synthetic linker duplex utilized E. coli codons and provided for an 
N-terminal methionine. The phosphorylated linker duplex formed between oligos #1267-08 and #1267-09 containing ^ 

25 Ndel and Kpnl cohesive ends, and the Kpnl and BamHI digested and purified PCR product described earlier (see * 
Section D), was directionally inserted between the Ndel and BamHI sites of pAMG21 using standard methodology. 
The ligation mixture was transformed into E. coli host 393 by electropo ration utilizing the manufacturer's protocol. 
Clones were selected, plasmid DNA was isolated, and DNA sequencing was performed to verify the DNA sequence 
of the muOPG-met[32-401] gene. 

30 Expression of recombinant muOPG-met [32-401] polypeptide from cultures of 393 cells harboring the pAMG21 

recombinant plasmid following induction was determined using methods described in Section C. 

Oligo #1267-08 

5'-TAT GGA CCC AGA AAC TGG TCA TCA GCT GCT GTG TGA 
TAA ATG TGC TCC GGG TAC-3 ' (SEQ ID NO: 69) 

40 

Oligo #1267-09 

5»-CCG GAG CAC ATT TAT CAC ACA GCA GCT GAT GAC CAG 
TTT CTG GGT CCA- 3 1 (SEQ ID NO: 70) 



F. Murine OPG met-lvs[22-401l 

so A DNA sequence coding for an N-terminal methionine followed by a lysine residue and amino acids 22 through 

401 of murine OPG was placed under control of the lux PR promoter in prokaryotic expression vector pAMG21 as 
follows. Synthetic oligos #1282-95 and #1282-96 were phosphorylated individually and allowed to form an oligo linker 
duplex using methods described in Section B. The synthetic linker duplex utilized E. coli codons and provided for an 
N-terminal methionine. The phosphorylated linker duplex formed between oligos #1282-95 and #1282-96 containing 

ss Ndel and Kpnl cohesive ends and the Kpnl and BamHI digested and purified PCR product described in Section D was 
directionally inserted between the Ndel and BamHI sites in pAMG21 using standard methodology. The ligation mixture 
was transformed into E. coli host 393 by electroporation utilizing the manufacturer's protocol. Clones were selected, 
plasmid DNA was isolated, and DNA sequencing was performed to verify the DNA sequence of the MuOPG— Met-Lys 
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[22-401] gene. 

Expression of recombinant MuOPG Met-Lys[22-401] polypeptide from transformed 393 cells harboring the recom- 
binant pAMG21 plasmid following induction was determined using methods described in Section C. 

5 

Oligo #1282-95 

5' -TAT GAA AGA AAC TCT GCC TCC AAA ATA CCT GCA TTA 
CGA TCC GGA AAC TGG TCA TCA GCT GCT GTG TGA TAA ATG TGC 

10 

TCC GGG TAC- 3' (SEQ ID NO: 71) 

75 Oligo #1282-96 

5 f -CCG GAG CAC ATT TAT CAC ACA GCA GCT GAT GAC CAG 
TTT CCG GAT CGT AAT GCA GGT ATT TTG GAG GCA GAG TTT CTT 
20 TCA-3 ■ (SEQ ID NO: 72) 

G. Murine OPG met-1ys-(his) 7 r22-4011 

25 A DNA sequence coding for N-terminal residues Met-Lys-His-His-His-His-His-His-His (=MKH) followed by amino * 

acids 22 through 401 of Murine OPG was placed under control of the lux PR promoter in prokaryotic expression vector 
pAMG21 as follows. PCR was performed using oligonucleotides #1300-50 and #1257-15 as primers and plasmid 
pAMG21-muOPG-met[22-401] DNA as template. Thermocycling conditions were as described in Section B. The re- 
sulting PCR sample was resolved on an agarose gel, the PCR product was excised, purified, cleaved with Ndel and 

30 BamHI restriction endonucleases and purified. The Ndel and BamHI digested and purified PCR product generated 
using oligo primers #1300-50 and #1257-15 was directionally inserted between the Ndel and BamHI sites of pAMG21 
using standard DNA methodology. The ligation mixture was transformed into EL coli host 393 by electroporation utilizing 
the manufacturer's protocol. Clones were selected, plasmid DNA was isolated, and DNA sequencing performed to 
verify the DNA sequence of the muOPG-MKH [22-401] gene. 

35 Expression of recombinant MuOPG-MKH [22-401] polypeptide from transformed 393 cultures harboring the re- 

combinant pAMG21 plasmid following induction was determined using methods described in Section C. 



40 Oligo #1300-50 

5 ' -GTT CTC CTC ATA TGA AAC ATC ATC ACC ATC ACC ATC 

ATG AAA CTC TGC CTC CAA AAT ACC TGC ATT ACG AT-3 ■ 
(SEQ ID NO: 73) 

45 

Oligo #1257-15 
(see Section D) 

50 

H. Murine OPG met- lysf 22-40 11 (h\s) 7 

A DNA sequence coding for a N-terminal met-lys, amino acids 22 through 401 murine OPG, and seven histidine 
residues following amino acid 401 (=muOPG MK[22-401]-H 7 ), was placed under control of the lux PR promoter in 
55 prokaryotic expression vector pAMG21 as follows. PCR was performed using oligonucleotides #1 300-49 and #1 300-51 
as primers and pAMG21-muOPG met[22-401] DNA as template. Thermocycling conditions were as described in Sec- 
tion B. The resulting PCR sample was resolved on an agarose gel, the PCR product was excised, purified, restricted 
with Ndel and BamHI restriction endonucleases, and purified. The Ndel and BamHI digested and purified PCR product 
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was directionally inserted between the Ndel and BamHI sites in pAMG21 using standard methodology. The ligation 
was transformed into E. ccjj host 393 by elect roporation utilizing the manufacturer's protocol. Clones were selected, 
plasmid DNA was isolated, and DNA sequencing was performed to verify the DNA sequence of the muOPG MK 
[22-401 ]-H7 gene. 

Expression of the recombinant muOPG MK-[22-401]-H 7 polypeptide from a transformed 393 cells harboring the 
recombinant pAMG21 plasmid following induction was determined using methods described in Section C. 



Oligo #1300-49 

10 * 

S'-GTT CTC CTC ATA TGA AAG AAA CTC TGC CTC CAA AAT 

ACC TGC A-3» (SEQIDNO:74) 



15 



20 



Oligo #1300-51 

5'-TAC GCA CTG GAT CCT TAA TGA TGG TGA TGG TGA TGA 
TGT AAG CAG CTT ATT TTC ACG GAT TGA ACC TGA TTC CCT A- 

3' (SEQ ID NO: 75) 

I, Murine OPG metr27-4Q11 

25 A DNA sequence coding for a N-terminal methionine and amino acids 27 through 401 of murine OPG was placed * 

under control of the lux PR promoter of prokaryotic expression vector pAMG21 as follows. PGR was performed with 
oligonucleotides #1309-74 and #1257-15 as primers and plasmid pAMG21-muOPG-met[22-401] DNA as template. 
- - Thermocycling conditions were as described in Section B. The resulting PCR sample was resolved on an agarose gel, 
the PCR product was excised, purified, cleaved with Ndel and BamHI restriction endonucleases, and purified. The 

30 Ndel and BamHI digested and purified PCR product was directionally inserted between the Ndei and BamHI sites of 
pAMG21 using standard methodology. The ligation mixture was transformed into E. coli host 393 by elect roporation 
utilizing the manufacturer's protocol. Clones were selected, plasmid DNA was isolated, and DNA sequencing was 
performed to verify the DNA sequence of the muOPG-met[27-401 ] gene. 

Expression of recombinant muOPG-met[27-401] polypeptide from a transfected 393 culture harboring the recom- 

35 binant pAMG21 plasmid following induction was determined using methods described in Section C. 

Oligo#1309-74 

40 5'-GTT CTC CTC ATA TGA AAT ACC TGC ATT ACG ATC CGG 

AAA CTG GTC AT- 3 1 (SEQ ID NO: 76) 



45 



Oligo#1257-15 
(See Section D) 



so J. Human OPG metf27-40n 

A DNA sequence coding for a N-terminal methionine and amino acids 27 through 401 of human OPG was placed 
under control of the lux PR promoter of prokaryotic expression vector pAMG21 as follows. PCR was performed using 
oligonucleotides #1309-75 and #1309-76 as primers and plasmid P AMG21-huOPG-met[22-401] DNA as template. 
ss Thermocycling conditions were as described in Section B. The resulting PCR sample was resolved on an agarose gel, 
the PCR product was excised, purified, restricted with Asel and BamHI restriction endonucleases, and purified. The 
Asel and BamHI digested and purified PCR product above was directionally inserted between the Ndel and BamHI 
sites of pAMG21 using standard methodology. The ligation mixture was transformed into coli host 393 by electro- 
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poration utilizing the manufacturer's protocol. Clones were selected, plasmid DNA was isolated, and DNA sequencing 
was performed to verify the DNA sequence of the huOPG-met[27-401] gene. 

Expression of the recombinant huOPG-met[27-401] polypeptide following induction of from transfected 393 cells 
harboring the recombinant pAMG21 piasmid was determined using methods described in Section C. 

5 

Oligo #1309-75 

5'-GTT CTC CTA TTA ATG AAA TAT CTT CAT TAT GAT GAA 
10 GAA ACT T-3 ' (SEQ ID NO: 77) 

Oligo #1309-76 

5'-TAC GCA CTG GAT CCT TAT AAG CAG CTT ATT TTT ACT 
GAT T-3' (SEQ ID NO: 78) 

20 

K. Murine OPG metr22-180l 

A DNA sequence coding for a N-terminal methionine and amino acids 22 through 180 of murine OPG was placed 
under control of the lux PR promoter of prokaryotic expression vector pAMG21 as follows. PCR was performed with 

25 oligonucleotides #1309-72 and #1309-73 as primers and plasmid pAMG21-muOPG-met[22-401] DNA as template. * 
Thermocycling conditions were as described in Section B. The resulting PCR sample was resolved on an agarose gel, 
the PCR product was excised, purified, restricted with Ndel and BamHI restriction endonucleases, and purified. The 
Ndel and BamHI digested and purified PCR product above was directionally inserted between the Ndel and BamHI 
sites of pAMG21 using standard methodology. The ligation was transformed into E. coli host 393 by elect roporation 

30 utilizing the manufacturer's protocol. Clones were selected, plasmid DNA was isolated, and DNA sequencing was 
performed to verify the DNA sequence of the muOPG-met[22-180] gene. 

Expression of recombinant muOPG-met[22-180] polypeptide from transformed 393 cultures harboring the recom- 
binant pAMG21 plasmid following induction was determined using methods described in Section C. 

35 

Oligo #1309-72 

5'-GTT CTC CTC ATA TGG AAA CTC TGC CTC CAA AAT ACC 
TGC A-3' (SEQ ID NO: 79) 

40 

Oligo #1309-73 

45 5 f -TAC GCA CTG GAT CCT TAT GTT GCA TTT CCT TTC TGA 

ATT AGC A-3 ' (SEQ ID NO: 80) 

L. Murine OPG metf27-1801 . 

50 

A DNA sequence coding for a N-terminal methionine and amino acids 27 through 1 80 of murine OPG was placed 
under the control of the lux PR promoter of prokaryotic expression vector pAMG21 as follows. PCR was performed 
using oligonucleotides #1309-74 (see Section I) and #1309-73 (see Section K) as primers and plasmid 
pAMG21-muOPG met[22-401] DNA as template. Thermocycling conditions were as described in Section B. The re- 
ss suiting PCR sample was resolved on an agarose gel, the PCR product excised, purified, restricted with Ndel and BamHI 
restriction endonucleases, and purified. The Ndel and BamHI digested and purified PCR product above was direction- 
ally inserted between the Ndel and BamHI sites in pAMG21 using standard methodology The ligation mixture was 
transformed into E. coli host 393 by electroporation utilizing the manufacturer's protocol. Clones were selected, plasmid 
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DNA was isolated, and DNA sequencing was performed to verify the DNA sequence of the muOPG met[27-1 80] gene. 

Expression of recombinant muOPG met[27-1 80] polypeptide from cultures of transformed 393 cells harboring the 
recombinant pAMG21 plasmid following induction was determined using methods described in Section C. 

,s M. Murine OPG met[22-1891 and met[22-194l 

A DNA sequence coding for a N-terminal methionine and either amino acids 22 through 189, or 22 through 194 
of murine OPG was placed under control of the lux PR promoter of prokaryotic expression vector pAMG21 as follows. 
The pair of synthetic oligonucleotides #1337-92 and #1337-93 (=muOPG-189 linker) or #1333-57 and #1333-58 

10 (=muOPG-1 94 linker) were phosphorylated individually and allowed to form an oligo linker duplex pair using methods 
described in Section B. Purified plasmid DNA of pAMG21-muOPG-met[22-401] was cleaved with Kpnl and BspEI 
restriction endonucleases and the resulting DNA fragments were resolved on an agarose gel. The -41 3 bp B fragment 
was isolated using standard recombinant DNA methodology The phosphorylated oligo linker duplexes formed between 
either oligos #1 337-92 and #1 337-93 (muOPG-1 89 linker) or oligos #1 333-57 and #1 333-58 (muOPG-1 94 linker) con- 

15 taining BspEI and BamHl cohesive ends, and the isolated -413 bp B fragment of plasmid pAMG21-muOPG-met 
[22-401] digested with Kpnl and BspEI restriction endonucleases above, was directionally inserted between the Kpnl 
and BamHl sites of pAMG21 -muOPG met[22-401 ] using standard methodology. Each ligation mixture was transformed 
into E. coli host 393 by electroporation utilizing the manufacturer's protocol. Clones were selected, plasmid DNA was 
isolated, and DNA sequencing was performed to verify the DNA sequence of either the muOPG-met[22-189] or 

20 muOPG-met[22-1 94] genes. 

Expression of recombinant muOPG-met[22-189] and muOPG-met[22-1 94] polypeptides from recombinant 
pAMG21 plasmids transformed into 393 cells was determined using methods described in Section C. 

25 Oligo #1337-92 

S'-CCG GAA ACA GAT AAT GAG- 3 1 (SEQTDNO:81) 

30 

Oligo #1337-33 

5 » -GAT CCT CAT TAT CTG TTT-3 * (SEQ ID NO: 82) 

35 

Oligo #1333-57 

5 f -CCG GAA ACA GAG AAG CCA CGC AAA AGT AAG-3' 

(SEQ ID NO: 83) 

40 

Oligo #1333-58 

5 • -GAT CCT TAC TTT TGC GTG GCT TCT CTG TTT-3' 

(SEQ ID NO: 84) 

N. Murine OPG met[27-1891 and metf27-194l 

so A DNA sequence coding for a N-terminal methionine and either amino acids 27 through 189, or 27 through 194 

of murine OPG was placed under control of the lux PR promoter of prokaryotic expression vector pAMG21 as follows. 
Phosphorylated oligo linkers either n muOPG-189 linker 1 * or n muOPG-1 94 linker" (see Section M) containing BspEI and 
BamHl cohesive ends, and the isolated -413 bp B fragment of plasmid pAMG21-muOPG-met[22-401] digested with 
Kpnl and BspEI restriction endonucleases were directionally inserted between the Kpnl and BamHl sites of plasmid 

55 pAMG21-muOPG-met[27-401] using standard methodology. Each ligation was transformed into E. coli host 393 by 
electroporation utilizing the manufacturer's protocol. Clones were selected, plasmid DNA was isolated, and DNA se- 
quencing was performed to verify the DNA sequence of either the muOPG met[27-189] ormuOPG met[27-194] genes. 
Expression of recombinant muOPG met[27-1 89] and muOPG met[27-1 94] following induction of 393 cells harbor- 
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ing recombinant pAMG21 plasmids was determined using methods described in Section C. 
O. Human OPG metf22-185l. metf22-1891, metr22-1941 

s A DNA sequence coding for a N-terminal methionine and either amino acids 22 through 185, 22 through 189, or 

22 through 1 94 of the human OPG polypeptide was placed under control of the lux PR promoter of prokaryotic expres- 
sion vector pAMG21 as follows. The pair of synthetic oligonucleotides #1 331 -87 and #1 331 -88 (=huOPG-1 85 linker), 
#1331-89 and #1331-90 (=huOPG-189 linker), or #1331-91 & #1331-92 (=huOPG-194 linker) were phosphorylated 
individually and each allowed to form an oligo linker duplex pair using methods described in Section B. Purified plasmid 

io DNA of pAMG21-huOPG-met[27-401] was restricted with Kpnl and Ndel restriction endonucleases and the resulting 
DNA fragments were resolved on an agarose gel. The -407 bp B fragment was isolated using standard recombinant 
DNA methodology. The phophorylated oligo linker duplexes formed between either oligos #1331-87 and #1331-88 
(huOPG-185 linker), oligos #1331-89 and #1331-90 (huOPG-189 linker), or oligos #1331-91 and #1331-92 (huOPG- 
194 linker)[each linker contains Ndel and BamHI cohesive ends], and the isolated -407 bp B fragment of plasmid 

is pAMG21-huOPG-met[27-401] digested with Kpnl and Ndel restriction endonucleases above, was directionally inserted 
between the Kpnl and BamHI sites of plasmid pAMG21 -huOPG-met[22-401 ] using standard methodology. Each ligation 
was transformed into E. coli host 393 by electroporation utilizing the manufacturer's protocol. Clones were selected, 
plasmid DNA was isolated, and DNA sequencing was performed to verify the DNA sequence of either the huOPG-met 
[22-185], huOPG-met[22-189], or huOPG-met[22-194] genes. 

20 Expression of recombinant huOPG-met[22-1 85], huOPG-met[22-1 89] or huOPG-met[22-1 94] in transformed 393 

cells harboring recombinant pAMG21 plasmids following induction was determined using methods described in Section 
C. 



25 



30 



35 



40 



45 



Oligo #1331-87 

5' -TAT GTT AAT GAG-3 1 (SEQ ID NO: 85) 
Oligo #1331-88 

5 1 -GAT CCT CAT TAA CA-3 ' (SEQ ID NO: 86) 



Oligo #1331-89 

5'-TAT GTT CCG GAA ACA GTT AAG- 3 ■ (SEQ ID NO: 87) 



Oligo #1331-90 

5 1 -GAT CCT TAA CTG TTT CCG GAA CA-3 ' (SEQ ID NO: 88) 



50 Oligo #1331-91 



5 ■ -TAT GTT CCG GAA ACA GTG AAT CAA CTC AAA AAT AAG- 

(SEQIDNO:89) 
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Oligo #1331-92 

5 '-GAT CCT TAT TTT TGA GTT GAT TCA CTG TTT CCG GAA 
* CA-3 ' (SEQIDNO:90) 

P. Human OPG met!27-1851, met f27-1891. met [27-1941 

10 A DNA sequence coding for a N-terminal methionine and either amino acids 27 through 185, 27 through 189, or 

27 throuqh 1 94 of the human OPG polypeptide was placed under control of the lux PR promoter of prokaryotic expres- 
sion vector pAMG21 as follows. Phosphorylated oligo linkers "huOPG-185 linker, "huOPG-189 linker", or "huOPG- 
194 linker" (See Section O) each containing Ndel and BamHI cohesive ends, and the isolated -407 bp B fragment of 
Dlasmid P AMG21-huOPG-met[27-401] digested with Kpnl and Ndel restriction endonucleases (See Section O) were 

is directionally inserted between the Kpnl and BamHI sites of plasmid P AMG21-huOPG-met[27-401] (See Section J) 
using standard methodology. Each ligation was transformed into E. coli host 393 by electroporation utilizing the man- 
ufacturer's protocol Clones were selected, plasmid DNA isolated, and DNA sequencing performed to verify the DNA 
sequence of either the huOPG-met[27-185], huOPG-met[27-189], or huOPG-met[27-194] genes. 

Expression of recombinant huOPG-met[27-l 85], huOPG-met[27-189], andhuOPG-met[27-194]1rom recombinant 

20 P AMG21 plasmids transformed into 393 cells was determined using methods described in Section C. 

O. Murine OPG metf27-401l (P33E. G36S. A45P) 

A DNA sequence coding for an N-terminal methionine and amino acids 27 through 48 of human OPG followed by ^ 
25 amino acid residues 49 through 401 of murine OPG was placed under control of the lux PR promoter of prokaryotic 
expression vector pAMG21 as follows. Purified plasmid DNA of P AMG21-huOPG-met[27-401] (See Section J) was 
cleaved with Aatll and Kpnl restriction endonucleases and a -1075 bp B fragment isolated from an agarose gel using 
standard recombinant DNA methodology. Additionally, plasmid P AMG21-muOPG-met[22-401] DNA (See Section D) 
was digested with Kpnl and BamHI restriction endonucleases and the -1064 bp B fragment isolated as described 
30 above The isolated -1075 bp P AMG21-huOPG-met[27-401] restriction fragment containing Aatll & Kpnl cohesive ends 
(see above) the -1064 bp pAMG21 -muOPG-met[22-401] restriction fragment containing Kpnl and BamHI sticky ends 
and a -5043 bp restriction fragment containing Aatll and BamHI cohesive ends and corresponding to the nucleic acid 
sequence of pAMG21 between Aatll & BamHI were ligated using standard recombinant DNA methodology. The ligation 
was transformed into E. coM host 393 by electroporation utilizing the manufacturer's protocol. Clones were selected 
35 and the presence of the recombinant insert in the plasmid verified using standard DNA methodology. muOPG-27-401 
(P33E G36S A45P)gene Amino acid changes in muOPG from proline-33 to glutamic acid-33, glycine-36 to serme- 
36, and alanine-45to proline-45, result from replacement of muOPG residues 27 through 48 with huOPG residues 27 

thr ° Expression of recombinant muOPG-met[27-401] (P33E, G36S, A45P) from transformed 393 cells harboring the 
40 recombinant pAMG21 plasmid was determined using methods described in Section C. 

R. Murine OPG met-lys-(his) z -ala-ser-(aspyivsf22-40 1l (A45T) 

A DNA sequence coding lor an N-terminal His tag and enterokinase recognition sequence which is (NH 2 to COOH 
45 terminus) : Met-Lys-His-His-His-His-His-His-His-Ala-Ser-Asp-Asp-Asp-Asp-Lys (=HEK), followed by amino acids 22 
through 401 of the murine OPG polypeptide was placed under control of the Jac repressor regulated Ps4 promoter as 
tollows pAMG22-His (See Section A) was digested with Nhel and BamHI restriction endonucleases, and the large 
fragment (the A fragment) isolated from an agarose gel using standard recombinant DNA methodology. Oligonucle- 
otides #1 282-91 and #1 282-92 were phosphorylated individually and allowed to form an oligo linker duplex using meth- 
50 ods previously described (See Section B). The phosphorylated linker duplex formed between oligos #1282-91 and 
#1282-92 containing Nhel and Kpnl cohesive ends, the Kpnl and BamHI digested and purified PCR product described 
(see Section D) and the At ragment of vector pAMG22-His digested with Nhel and BamHI were ligated using standard 
recombinant DNA methodology. The ligation was transformed into E coli host GM1 20 by electroporation utilizing the 
manufacturer's protocol. Clones were selected, plasmid DNA isolated and DNA sequencing performed to verify the 
ss DNA sequence of the muOPG-HEK[22-401] gene. DNA sequencing revealed a spurious mutation in the natural muOPG 
sequence that resulted in a single amino acid change of Alanine-45 of muOPG polypeptide to a Threonine. 

Expression of recombinant muOPG-HEK[22-401] (A45T) from GM120 cells harboring the recombinant pAMG21 
plasmid was determined using methods similar to those described in Section C, except instead of addition of the 
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synthetic autoinducer, IPTG was added to 0.4 mM final to achieve induction. 

Oligo #1282-91 

5 ' -CTA GCG ACG ACG ACG ACA AAG AAA CTC TGC CTC CAA 
AAT ACC TGC ATT ACG ATC CGG AAA CTG GTC ATC AGC TGC TGT 
GTG ATA AAT GTG CTC CGG GTA C-3' (SEQIDNO:91) 

10 - 

Oligo #1282-92 

5 '-CCG GAG CAC ATT TAT CAC ACA GCA GCT GAT GAC CAG 
TTT CCG GAT CGT AAT GCA GGT ATT TTG GAG GCA GAG TTT CTT 
TGT CGT CGT CGT CG-3 '(SEQ ID NO: 92) 

20 S. Human OPG met-arq-qly-ser-(hts)J22-401l 

Eight oligonucleotides (1338-09 to 1338-16 shown below) were designed to produce a 175 base fragment as 
overlapping, double stranded DNA. The oligos were annealed, ligated, and the 5' and 3' oligos were used as PCR 
primers to produce large quantities of the 175 base fragment. The final PCR gene products were digested with restriction 

25 endonucleases Clal and Kpnl to yield a fragment which replaces the N-terminal 28 codons of human OPG. The Clal * 
and Kpnl digested PCR product was inserted into pAMG21-huOPG [27-401] which had also been cleaved with Clal 
and Kpnl. Ligated DNA was transformed into competent host cells of E. coli strain 393. Clones were screened for the 
ability to produce the recombinant protein product and to possess the gene fusion having the correct nucleotide se- 
quence. Protein expression levels were determined from 50 ml shaker flask studies. Whole cell lysate and sonic pellet 

30 were analyzed for expression of the construct by Coomassie stained PAGE gels and Western analysis with murine •-. 
antiOPG antibody. Expression of huOPG Met-Arg-Gly-Ser-(His) 6 [22-401] resulting in the formation of large inclusion 
bodies and the protein was localized to the insoluble (pellet) fraction. 

35 

1338-0? 

ACA AAC ACA ATC GAT TTG ATA CTA GA (SEQ ID NO: 93) 

40 

1338-10 

TTT GTT TTA ACT AAT TAA AGG AGG AAT AAA ATA TGA GAG GAT CGC ATC AC 

45 (SEQ ID NO: 94) 

1338-11 

CAT CAC CAT CAC GAA ACC TTC CCG CCG AAA TAC CTG CAC TAC GAC GAA GA 

so 

(SEQ ID NO: 95) 



1338-12 

AAC CTC CCA CCA GCT GCT GTG CGA CAA ATG CCC GCC GGG TAC CCA AAC A 

(SEQ ID NO: 96) 
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1338-13 

TGT TTG GGT ACC CGG CGG GCA TTT GT (SEQIDNO:97) 

5 

1338-14 

to CGC ACA GCA GCT GGT GGG AGG TTT CTT CGT CGT AGT GCA GGT ATT TCG GC 

(SEQ ID NO: 98) 

1S 1338-15 

GGG AAG GTT TCG TGA TGG TGA TGG TGA TGC GAT CCT CTC ATA TTT TAT T 

(SEQ ID NO: 99) . 

20 

1338-16 

CCT CCT TTA ATT AGT TAA AAC AAA TCT AGT ATC AAA TCG ATT GTG TTT GT 

(SEQ ID NO: 100) 

25 

T. Human OPG meMvsP22-401 1 and met(lvsW22-4011 

To construct the met-lys and met-(lys)3 versions of human OPG[22-401], overlapping oligonucleotides were de- 
signed to add the appropriate number of lysine residues. The two oligos for each construct were designed to overlap, 
allowing two rounds of PCR to produce the final product. The template for the first PCR reaction was a plasmid DNA 
preparation containing the human OPG 22-401 gene. The first PCR added the lysine residue(s). The second PCR 
used the product of the first round and added sequence back to the first restriction site, Clal. 

The final PCR gene products were digested with restriction endonucleases Clal and Kpnl, which replace the N- 
terminal 28 codons of hu OPG, and then ligated into plasmid pAMG21 -hu OPG [27-401] which had been also digested 
with the two restriction endonucleases. Ligated DNA was transformed into competent host cells of E. coli strain 393. 
Clones were screened for the ability to produce the recombinant protein product and to possess the gene fusion having 
the correct nucleotide sequence. Protein expression levels were determined from 50 ml shaker flask studies. Whole 
cell lysate and sonic pellet were analyzed for expression of the construct by Coomassie stained PAGE gels and Western 
analysis with murine anti-OPG antibody. Neither construct had a detectable level of protein expression and inclusion 
bodies were not visible. The DNA sequences were confirmed by DNA sequencing. 
Oligonucleotide primers to prepare Met-Lys huOPG[22-401]: 

1338-17 

ACA AAC ACA ATC GAT TTG ATA CTA GAT TTG TTT TAA CTA ATT 
AAA GGA GGA ATA AAA TG (SEQ ID NO: 101) 

50 

1338-18 

CTA ATT AAA GGA GGA ATA AAA TGA AAG AAA CTT TTC CTC CAA 

AAT ATC (SEQ ID NO: 102) 

55 
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1338-20 

TGT TTG GGT ACC CGG CGG ACA TTT ATC ACA C (SEQ ID NO: 103) 

5 

Oligonucleotide primers to prepare Met-(Lys) 3 -huOPG [22-401 ]: 

10 1338-17 

ACA AAC ACA ATC GAT TTG ATA CTA GAT TTG TTT TAA CTA ATT 
AAA GGA GGA ATA AAA TG (SEQ ID NO: 104) 

15 

1338-19 

CTA ATT AAA GGA GGA ATA AAA TGA AAA AAA AAG AAA CTT TTC 

20 

CTC CAA AAT ATC (SEQ ID NO: 105) 



25 1338-20 

TGT TTG GGT ACC CGG CGG ACA TTT ATC ACA C (SEQ ID NO: 106) 

30 U, Human and Murine QPG [22-401 1/Fc Fusions 

Four OPG-Fc fusions were constructed where the Fc region of human lgG1 was fused at the N-terminus of either 
human or murine Osteoprotegerin amino acids 22 to 401 (referred to as Fc/OPG [22-401 ]) or at the C-terminus (referred 
to as OPG[22-401]/Fc). Fc fusions were constructed using the fusion vector pFc-A3 described in Example 7. 

35 All fusion genes were constructed using standard PCR technology. Template for PCR reactions were plasmid 

preparations containing the target genes. Overlapping oligos were designed to combine the C-terminal portion of one 
gene with the N terminal portion of the other gene: This process allows fusing the two genes together in the correct 
reading frame after the appropriate PCR reactions have been performed: Initially one "fusion" oligofpr each gene was 
put into a PCR reaction with a universal primer for the vector carrying the target gene. The complimentary "fusion" 

40 oligo was used with a universal primer to PCR the other gene. At the end of this first PCR reaction, two separate 
products were obtained, with each individual gene having the fusion site present, creating enough overlap to drive the 
second round of PCR and create the desired fusion. In the second round of PCR, the first two PCR products were 
combined along with universal primers and via the overlapping regions, the full length fusion DNA sequence was 
produced. 

45 The final PCR gene products were digested with restriction endonucleases Xbal and BamH!, and then ligated into 

the vector pAMG21 having been also digested with the two restriction endonucleases. Ligated DNA was transformed 
into competent host cells of E. coli strain 393. Clones were screened for the ability to produce the recombinant protein 
product and to possess the gene fusion having the correct nucleotide sequence. Protein expression levels were de- 
termined from 50 ml shaker flask studies. Whole cell lysate, sonic pellet, and supernatant were analyzed for expression 

so of the fusion by Coomassie stained PAGE gels and Western analysis with murine anti-OPG antibody. 

Fc/huQPG [22-4011 

Expression of the Fc/hu OPG [22-401] fusion peptide was detected on a Coomassie stained PAGE gel and on a 
55 Western blot. The cells have very large inclusion bodies, and the majority of the product is in the insoluble (pellet) 
fraction. The following primers were used to construct this OPG-Fc fusion: 



45 
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1318-48 

CAG CCC GGG TAA AAT GGA AAC GTT TCC TCC AAA ATA TCT TCA 
TT (SEQIDNO: 107) 



10 1318-49 

CGT TTC CAT TTT ACC CGG GCT GAG CGA GAG GCT CTT CTG CGT 

GT (SEQIDNO: 108) 



15 



25 



30 



S5 



Fc/muOPG [22-4011 



Expression of the fusion peptide was detected on a Coomassie stained gel and on a Western blot. The cells have 
very large inclusion bodies, and the majority of the product is in the insoluble (pellet) fraction. The following primers 
20 were used to construct this OPG-Fc fusion: 



1318-50 

CGC TCA GCC CGG GTA AAA TGG AAA CGT TGC CTC CAA AAT ACC 
TGC (SEQIDNO: 109) 



1318-51 

CCA TTT TAC CCG GGC TGA GCG AGA GGC TCT TCT GCG TGT 
(SEQIDNO: 110) 

35 miiOPG [22-40n/Fc J 

Expression of the fusion peptide was detected on a Coomassie stained gel and on a Western blot. The amount of 
recombinant product was less than the OPG fusion proteins having the Fc region in the N terminal position. Obvious 
inclusion bodies were not detected. Most of the product appeared to be in the insoluble (pellet) fraction. The following 
40 primers were used to construct this OPG-Fc fusion: 

1318-54 

45 GAA AAT AAG CTG CTT AGC TGC AGC TGA ACC AAA ATC 

(SEQ ID NO: HI) 

so 1318-55 

CAG CTG CAG CTA AGC AGC TTA TTT TCA CGG ATT G 

(SEQ ID NO: 112) 



huOPG r22-401VFc 

Expression of the fusion peptide was not detected on a Coomassie stained gel, although a faint Western positive 
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signal was present. Obvious inclusion bodies were not detected. The following primers were used to prepare this OPG- 
Fc fusion: 

5 1318-52 

AAA AAT AAG CTG CTT AGC TGC AGC TGA ACC AAA ATC 
(SEQ ID NO: 113) 

10 

1318-53 

CAG CTG CAG CTA AGC AGC TTA TTT TTA CTG ATT GG 
is (SEQ ID NO: 114) 

V. Human OPG metf22-40n-Fc fusion (P25A) 

-This construct combines a proline to alanine amino acid change at position 25 (P25A) with the huOPG met[22-401 ]- 
20 Fc fusion. The plasmid was digested with restriction endonucleases Clal and Kpnl, which removes the N-terminal 28 
codons of the gene, and the resulting small (less than 200 base pair) fragment was gel purified. Thisf ragment containing 
the proline to alanine change was then ligated into plasmid pAMG21-huOPG [22-401 ]-Fc fusion which had been di- 
gested with the two restriction endonucleases. The ligated DNA was transformed into competent host cells of E. coli 
strain 393. Clones were screened for the ability to produce the recombinant protein product and to possess the gene 
25 fusion having the correct nucleotide sequence. Protein expression levels were determined from 50 ml shaker flask 
studies. Whole cell lysate and sonic pellet were analyzed for expression of the construct by Coomassie stained PAGE 
gels and Western analysis with murine anti-OPG antibody. The expression level of the fusion peptide was detected on 
a Coomassie stained PAGE gel and on a Western blot. The protein was in the insoluble (pellet) fraction. The cells had 
large inclusion bodies. 

30 

W. Human OPG metf22-401l (P25A1 

A DNA sequence coding for an N-terminal methionine and amino acids 22 through 401 of human OPG with the 
proline at position 25 being substituted by alanine under control of the lux PR promoter in prokaryotic expression vector 

35 pAMG21 was constructed as follows: Synthetic oligos # 1289-84 and 1289-85 were annealed to form an oligo linker 
duplex with Xbal and Kpnl cohesive ends: The synthetic linker duplex utilized optimal E. coii codons and encoded an 
N-terminal methionine. The linker also included an Spe! restriction site which was not present in the original sequence. 
The linker duplex was directionaily inserted between the Xbal and Kpnl sites in pAMG21 -huOPG-22-401 using standard 
methods. The ligation mixture was introduced into E. colj host GM221 by transformation. Clones were initially screened 

40 for production of the recombinant protein. Plasmid DNA was isolated from positive clones and DNA sequencing was 
performed to verify the DNA sequence of the HuOPG-Met[22-401] (P25A) gene. The following oligonucleotides were 
used to generate the Xbal - Kpnl linker: 

45 Oligo #1289-84 

5" -CTA GAA GGA GGA ATA ACA TAT GGA AAC TTT TGC TCC 
AAA ATA TCT TCA TTA TGA TGA AGA AAC TAG TCA TCA GCT GCT 
50 GTG TGA TAA ATG TCC GCC GGG TAC -3 1 (SEQ ID NO: 115) 



55 
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Oligo #1289-85 

5'- CCG GCG GAC ATT TAT CAC ACA GCA GCT GAT GAC TAG 
s TTT CTT CAT CAT aa T GAA GAT ATT TTG GAG CAA AAG TTT CCA 

TAT GTT ATT CCT CCT T-3 1 (SEQ ID NO: 116) 

10 X. Human OPG metr22-4011 (P26A) and (P26D) 

A DNA sequence coding for an N-terminal methionine and amino acids 22 through 401 of human OPG with the 
proline at position 26 being substituted by alanine under control of the lux PR promoter in prokaryotic expression vector 
pAMG21 was constructed as follows: Synthetic oligos # 1289-86 and 1289-87 were annealed to form an oligo linker 

is duplex with Xbal and Spel cohesive ends. The synthetic linker duplex utilized optimal E. coy codons and encoded an 
N-terminal methionine. The linker duplex was directionally inserted between the Xbal and Spel sites in p AMG21 -huOPG 
[22-401 ] (P25A) using standard methods. The ligation mixture was introduced into E. coli host GM221 by transformation. 
Clones were initially screened for production of the recombinant protein. Plasmid DNA was isolatedfrom positive clones 
and DNA sequencing was performed to verify the DNA sequence of the huOPG-met[22-401] (P26A) gene. One of the 

20 clones sequenced was found to have the proline at position 26 substituted by aspartic acid rather than alanine, and 
this clone was designated huOPG-met[22-401] (P26D). The following oligonucleotides were used to generate the Xbal 
- Spel linker: 

25 Oligo #1289-86 

5' - CTA GAA GGA GGA ATA ACA TAT GGA AAC TTT TCC 
TGC TAA ATA TCT TCA TTA TGA TGA AGA AA - 3 1 (SEQ ID NO: 117) 



30 



35 



40 



Oligo #1289-87 

5" - CTA GTT TCT TCA TCA TAA TGA AGA TAT TTA GCA 
GGA AAA GTT TCC ATA TGT TAT TCC TCC TT - 3 \ 
(SEQ ID NO: 118) 



Y. Human OPG met[22-1941 (P25A) 

A DNA sequence coding for an N-terminal methionine and amino acids 22 through 194 of human OPG with the 
45 proline at position 25 being substituted by alanine under control of the lux P R promoter in prokaryotic expression vector 
pAMG21 was constructed as follows: The plasmids P AMG21-huOPG[27-194] and p AMG 21 -huOPG [22-401] (P25A) 
were each digested with Kpnfand BamHI endonucleases. The 450 bp fragment was isolated from pAMG21-huOPG 
[27-194] and the 6.1 kbp fragment was isolated from pAMG21 -huOPG[22-401] (P25A). These fragments were I i gated 
together and introduced into E.coli host GM221 by transformation. Clones were initially screened for production of the 
50 recombinant protein. Plasmid DNA was isolated from positive clones and DNA sequencing was performed to verify 
the DNA sequence of the huOPG-Met[22-1 94] (P25A) gene. 

EXAMPLE 9 

55 Association of OPG Monomers 

CHO cells engineered to overexpress muOPG [22-401] were used to generate conditioned media for the analysis 
of secreted recombinant OPG using rabbit polyclonal anti-OPG antibodies. An aliquot of conditioned media was con- 
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centrated 20-fold, then analysed by reducing and non-reducing SDS-PAGE (Figure 15). Under reducing conditions, 
the protein migrated as a Mr 50-55 kd polypeptide, as would be predicted if the mature product was glycosylated at 
one or more of its consensus N-linked glycosylation sites. Suprisingly, when the same samples were analysed by non- 
reducing SDS-PAGE, the majority of the protein migrated as an approximately 100 kd polypeptide, twice the size of 
s the reduced protein. In addition, there was a smaller amount of the Mr 50-55 kd polypeptide. This pattern of migration 
on SDS-PAGE was consistent with the notion that the OPG product was forming dimers through oxidation of a free 
sulfhydryl group(s). 

The predicted mature OPG polypeptide contains 23 cysteine residues, 18 of which are predicted to be involved in 
forming intrachain disulfide bridges which comprise the four cysteine-rich domains (Figure 12A). The five remaining 

to C-terminal cysteine residues are not involved in secondary structure which can be predicted based upon homology 
with other TNFR family members. Overall there is a net uneven number of cysteine residues, and it is formally possible 
that at least one residue is free to form an intermolecular disulfide bond between two OPG monomers. 

To help elucidate patterns of OPG kinesis and monomer association, a pulse-chase labelling study was performed. 
CHO cells expressing muOPG [22-401] were metabolically labelled as described above in serum-free medium 

is containing 35 S methionine and cysteine for 30 min. After this period, the media was removed, and replaced with com- 
plete medium containing unlabelled methionine and cysteine at levels approximately 2,000-fold excess to the original 
concentration of radioactive amino acids. At 30 min, 1 hr, 2 hr, 4 hr, 6 hr and 1 2 hr post addition, cultures were harvested 
by the removal of the conditioned media, and lysates of the conditioned media and adherent monolayers were prepared. 
The culture media and cell lysates were clarified as described above, and then immunoprecipitated using anti-OPG 

20 antibodies as described above. After the immunoprecipitates were washed, they were released by boiling in non- 
reducing SDS-PAGE buffer then split into two equal halves. To one half, the reducing agent (3-mercaptothanol was 
added to 5% (v/v) final concentration, while the other half was maintained in non-reducing conditions. Both sets of 
immunoprecipitates were analysed by SDS-PAGE as described above, then processed for autoradiography and ex- 
posed to film. The results are shown in Figure 16. The samples analysed by reducing SDS-PAGE are depicted in the 

25 bottom two panels. After synthesis, the OPG polypeptide is rapidly processed to a slightly larger polypeptide, which* 
probably represents modification by N-linked g lycos lyat ion. After approximately 1 -2 hours, the level of OPG in the cell 
decreases dramatically, and concomitantly appears in the culture supernatant. This appears to be the result of the 
vectoral transport of OPG from the cell into the media over time, consistent with the notion that OPG is a naturally 
secreted protein. Analysis of the same immunoprecipitates under nonreducing conditions reveals the relationship be- 

30 tween the formation of OPG dimers and secretion into the conditioned media (Figure 16, upper panels). In the first 
30-60 minutes, OPG monomers are processed in the cell by apparent glycoslylation, followed by dimer formation. Over 
time, the bulk of OPG monomers are driven into dimers, which subsequently disappear from the cell. Beginning about 
60 minutes after synthesis, OPG dimers appear in the conditioned media, and accumulate over the duration of the 
experiment. Following this period, OPG dimers are formed, which are then secreted into the culture media. OPG mon- 

35 omers persist at a low level inside the cell over time, and small amounts also appear in the media. This does not appear 
to be the result of breakdown of covalent OPG dimers, but rather the production of sub-stoichiornetric amounts of 
monomers in the cell and subsequent secretion. 

Recombinantly produced OPG from transfected CHO cells appears to be predominantly a dimer. To determine if 
dimerization is a natural process in OPG synthesis, we analysed the conditioned media of a cell line found to naturally 

40 express OPG. The CTLL-2 cell line, a murine cytotoxic T lymphocytic cell line (ATCC accession no. TIB-214), was 
found to express OPG mRNA in a screen of tissue and cell line RNA. The OPG transcript was found to be the same 
as the cloned and sequenced 2.5-3.0 kb RNA identified from kidney and found to encode a secreted molecule. Western 
blot analysis of conditioned media obtained from CTLL-2 cells shows that most, if not all, of the OPG protein secreted 
is a dimer (Figure 17). This suggests that OPG dimerization and secretion is not an artifact of overexpression in a ceil 

45 line, but is likely to be the main form of the product as it is produced by expressing cells. 

Normal and transgenic mouse tissues and serum were analysed to determine the nature of the OPG molecule 
expressed in OPG transgenic mice. Since the rat OPG cDNA was expressed under the control of a hepatocyte control 
element, extracts made from the parenchyma of control and transgenic mice under non-reducing conditions were 
analysed (Figure 18). In extract from transgenic, but not control mice, OPG dimers are readily detected, along with 

50 substoichiometric amounts of monomers. The OPG dimers and monomers appear identical to the recombinant murine 
protein expressed in the genetically engineered CHO ceils. This strongly suggests that OPG dimers are indeed a 
natural form of the gene product, and are likely to be key active components. Serum samples obtained from control 
and transgenic mice were similarly analysed by western blot analysis. In control mice, the majority of OPG protein 
migrates as a dimer, while small amounts of monomer are also detected. In addition, significant amounts of a larger 

55 OPG related protein is detected, which migrates with a relative molecular mass consistent with the predicted size of a 
covalently-linked trimer. Thus, recombinant OPG is expressed predominantly as a dimeric protein in OPG transgenic 
mice, and the dimer form may be the basis for the osteopetrotic phenotype in OPG mice. OPG recombinant protein 
may also exist in higher molecular weight "trimeric" forms. 
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To determine if the five C-terminal cysteine residues of OPG play a role in homodimerization, the murine OPG 
codons for cytsteine residues 195 (C195), C202, C277, C319, and C400 were changed to serine using the Quick- 
Change™ Site-Directed Mutagenesis Kit (Stratagene, San Diego, CA) as described above. The muOPG gene was 
subcloned between the Not I and Xba I sites of the pcDNA 3.1 (+) vector (Invitrogen, San Diego, CA). The resulting 
5 plasmid, pcDNA3.1-muOPG, and mutagenic primers were treated with Pfu polymerase in the presence of deoxynu- 
cleotides, then amplified in a thermocycler as described above. An aliqout of the reaction is then transfected into 
competent E. coli XL1 -Blue by heatshock, then plated. Plasmid DNA from transformants was then sequenced to verify 
mutations. 

The following primer pairs were used to change the codon for cysteine residue 1 95 to serine of the murine OPG 
10 gene, resulting in the production of a muOPG [22-401] C1 95S protein: 



1389-19: 

5' -CAC GCA AAA GTC GGG AAT AGA TGT CAC- 3 ' (SEQ ID NO: 150) 



1406-38: 

5' -GTG ACA TCT ATT CCC GAC TTT TGC GTG-3' (SEQ ID NO: 151) 

20 

The following primer pairs were used to change the codon for cysteine residue 202 to serine of the murine OPG 
gene, resulting in the production of a muOPG [22-401] C202S protein: 

25 

1389-21: 

5' -CAC CCT GTC GGA AGA GGC CTT CTT C-3' (SEQ ID NO: 152) 

30 

1389-22: 

5' -GAA GAA GGC CTC TTC CGA CAG GGT G-3' (1389-22) 

(SEQ ID NO: 153) 

35 

The following primer pairs were used to change the codon for cysteine residue 277 to serine of the murine OPG 
gene, resulting in the production of a muOPG [22-401] C277S protein: 



40 1389-23: 

5' -TGA CCT CTC GGA AAG CAG CGT GCA-3' (SEQ ID NO: 154) 

45 1389-24: 

5' -TGC ACG CTG CTT TCC GAG AGG TCA- 3' (SEQ ID NO: 155) 

The following primer pairs were used to change the codon for cysteine residue 31 9 to serine of the murine OPG 
so gene, resulting in the production of a muOPG [22-401] C31 9S protein: 



1389-17: 

5' -CCT CGA AAT CGA GCG AGC AGC TCC-3' (SEQ ID NO: 156) 
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1389-18: 

5' -CGA TTT CGA GGT CTT TCT CGT TCT C-3'.(SEQ ID NO: 157) 

5 

The following primer pairs were used to change the codon for cysteine residue 400 to serine of the murine OPG 
gene, resulting in the production of a muOPG [22-401] C400S protein: 

10 1406-72: 

5' -CCG TGA AAA TAA GCT CGT TAT AAC TAG GAA TGG-3 '(SEQ ID NO: 158) 
1S 1406-75: 

5' -CCA TTC CTA GTT ATA ACG AGC TTA TTT TCA CGG-3 ' (SEQ ID NO: 159) 

Each resulting muOPG [22-401] plasmid containing the appropriate mutation was then transfected into human 
20 293 cells, the mutant OPG-Fc fusion protein pu rified from conditioned media as described above. The biological activity 
of each protein was assessed the in vitro osteoclast forming assay described in example 11 . Conditioned media from 
each transfectant was analysed by non-reducing SDS-PAGE and western blotting with anti-OPG antibodies. 

Mutation of any of the five C-terminal cysteine residues results in the production of predominantly (>90%) mono- 
meric 55 kd OPG molecules. This strongly suggests that the C-terminal cysteine residues together play a role in OPG # 
25 homodimerization. 

C-termihalOPG deletion mutants were constructed to map the region (s) of the OPG C-terminaldomain which are 
important for OPG homodimerization. These OPG mutants were constructed by PCR amplification using primers which 
introduce premature stop translation signals in the C-terminal region of murine OPG. The 5' oligo was designed to the 
MuOPG start codon (containing a Hindlll restriction site) and the 3' oligonucleotides (containing a stop codon and Xhol 
30 site) were designed to truncate the C-terminal region of muOPG ending at either threonine residue 200 (CT 200), 
proline 212 (CT212), glutamic acid 293 (CT-293), or serine 355 (CT-355). 

The following primers were used to construct muOPG [22-200]: 

35 1091-39:. 

5' -CCT CTG AGC TCA AGC TTC CGA GGA CCA CAA TGA ACA 
AG- 3' (SEQ ID NO: 160) 

40 

1391-91: 

5' -CCT CTC TCG AGT CAG GTG ACA TCT ATT CCA CAC TTT 
45 TGC GTG GC-3' ( 1391- 91 ) (SEQ ID NO: 161) 

The following primers were used to construct muOPG [22-212]: 

so 1091-39: 

5' -CCT CTG AGC TCA AGC TTC CGA GGA CCA CAA TGA ACA 
AG-3'(SEQIDNO:162) 
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10 



15 



25 



30 



1391-90: 

5' -CCT CTC TCG AGT CAA GGA ACA GCA AAC CTG AAG AAG 

GC -3' (SEQ ID NO: 163) 

The following primers were used to construct muOPG [22-293]: 
1091-39: 

5' -CCT CTG AGC TCA AGC TTC CGA GGA CCA CAA TGA ACA 
AG- 3' (SEQ ID NO: 164) 



1391-89: 

5'- CCT CTC TCG AGT CAC TCT GTG GTG AGG TTC GAG TGG 

20 CC- 3 ' (SEQ ID NO: 165) 

The following primers were used to construct muOPG [22-355]: 



1091-39: 

5' -CCT CTG AGC TCA AGC TTC CGA GGA CCA CAA TGA ACA 
AG- 3' (SEQ ID NO: 166) 



1391-88: 

5' CCT CTC TCG AGT CAG GAT GTT TTC AAG TGC TTG AGG GC-3' 

35 (SEQ ID NO: 167) 

Each resulting muOPG-ct plasmid containing the appropriate truncation was then transf ected into human 293cells 
the mutant OPG-Fc fusion protein purified from conditioned media as described above. The biological activity of each 
40 protein was assessed the in vitro osteoclast forming assay described in example 11 . The condrtioned med.as were 
also analysed by non-reducing SDS-PAGE and western blotting using anti-OPG antibodies. 

Truncation of the C-terminal region of OPG effects the ability of OPG to form homodimers. CT 355 is predominantly 
monomeric although some dimer is formed. CT 293 forms what appears to be equal molar amounts of monomer and 
dimer, and also high molecular weight aggregates. However, CT 212 and CT 200 are monomeric. 

45 

EXAMPLE 10 

Purification of OPG 

so A. Purification of mammalian OPG-Fc Fusion Proteins 

5 L of conditioned media from 293 cells expressing an OPG-Fc fusion protein were prepared as follows. A frozen 
sample of cells was thawed into 10 ml of 293S media (DMEM-high glucose, 1x L-glutamine, 10% heat inactivated fetal 
bovine serum (FBS) and 100 ug/ml hygromycin) and fed with fresh media after one day. After three days, cells were 
ss split into two T1 75 flasks at 1 : 1 0 and 1 :20 dilutions. Two additional 1:10 splits were done to scale up to 200 T175 flasks. 
Cells were at 5 days post-thawing at this point. Cells were grown to near confluency (about three days) at which time 
serum-containing media was aspirated, cells were washed one time with 25 ml PBS per flask and 25 ml of SF media 
(DMEM-high glucose, 1x L-glutamine) was added to each flask. Cells were maintained at 5% C02 for three days at 
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which point the media was harvested, centrifuged, and filtered through 0.45m cellulose nitrate filters (Corning). 

OPG-Fc fusion proteins were purified using a Protein G Sepharose column (Pharmacia) equilibrated in PBS. The 
column size varied depending on volume of starting media. Conditioned media prepared as described above was 
loaded onto the column, the column washed with PBS, and pure protein eluted using 1 0OmM glycine pH 2.7. Fractions 
5 were collected into tubes containing 1M Tris pH 9.2 in order to neutralize as quickly as possible. Protein containing 
fractions were pooled, concentrated in either an Amicon Centricon 10 or Centriprep 10 and diafiltered into PBS. The 
pure protein is stored at -80°C. 

Murine [22-401 ]-Fc, Murine [22-180]-Fc, Murine [22-194]-Fc, human [22-401 ]-Fc and human [22-201 ]Fc were pu- 
rified by this procedure. Murine [22-185]-Fc is purified by this procedure. 
10 . - 

B. Preparation of anti-OPG antibodies 

Three New Zealand White rabbits (5-8 lbs initial wt) were injected subcutaneousiy with muOPG[22- 401]-Fc fusion 
protein. Each rabbit was immunized on day 1 with 50 ug of antigen emulsified in an equal volume of Freunds complete 

15 adjuvant. Further boosts (Days 14 and 28) were performed by the same procedure with the substitution of Freunds 
incomplete adjuvant. Antibody titers were monitored by El A. After the second boost, the antisera revealed high antibody 
titers and 25ml production bleeds were obtained from each animal. The sera was first passed over an affinity column 
to which murine OPG-Fc had be immobilized. The anti-OPG antibodies were eluted with Pierce Gentle Elution Buffer 
containing 1% glacial acetic acid. The eluted protein was then dialyzed into PBS and passed over a Fc column to 

20 remove any antibodies specific for the Fc portion of the OPG fusion protein. The run through fractions containing anti- 
OPG specific antibodies were dialyzed into PBS. 

C. Purification of murine OPGr22-4011 

25 Antibody Affinity Chromatography 

Affinity purified anti-OPG antibodies were diafiltered into coupling buffer (0.1 M sodium carbonate pH 8.3, 0.5M 
NaCl), and mixed with CNBr-activated sepharose beads (Pharmacia) for two hours at room temperature. The resin 
was then washed with coupling buffer extensively before blocking unoccupied sited with 1 M ethanolamine (pH 8.0) for 

30 two hours at room temperature. The resin was then washed with low pH (0.1M sodium acetate pH 4.0, 0.5M NaCl) 
followed by a high pH wash (0. 1 M Tris-HCI pH 8.0, 0.5M NaCl). The last washes were repeated three times. The resin 
was finally equilibrated with PBS before packing into a column. Once packed, the resin was washed with PBS. A blank 
elution was performed with 0.1 M glycine-HCI, pH 2.5), followed by re-equilibration with PBS. 

Concentrated conditioned media from CHO cells expressing muOPG[22-41 0] was applied to the column at a low 

35 flow rate. The column was washed with PBS until UV absorbance measured at 280hm returned to baseline. The protein 
was eluted from the column first with 0.1 M glycine-HCI (pH 2.5), re-equilibrated with PBS, and eluted with a second 
buffer (0.1M CAPS, pH 10.5), 1M NaCl). The two elution pools were diafiltered separately into PBS and sterile filtered 
before freezing at -20° C. 

40 Conventional Chromatography 

CHO cell conditioned media was concentrated 23x in an Amicon spiral wound cartridge (S10Y10) and diafiltered 
into 20mM tris pH 8.0. The diafiltered media was then applied to a Q-sepharose HP (Pharmacia) column which had 
been equilibrated with 20mM tris pH 8.0. The column was then washed until absorbence at 280nm reached baseline. 
45 Protein was eluted with a 20 column volume gradient of 0-300mM NaCl in tris pH 8.0. OPG protein was detected using 
a western blot of column fractions. 

Fractions containing OPG were pooled and brought to a final concentration of 300mM NaCl, 0.2mM DTT A NiNTA 
superose (Qiagen) column was equilibrated with 20mM tris pH 8.0, 300mM NaCl, 0.2mM DTT after which the pooled 
fractions were applied. The column was washed with equilibration buffer until baseline absorbence was reached. Pro- 
so teins were eluted from the column with a 0-30mM Imidazole gradient in equilibration buffer. Remaining proteins were 
washed off the column with 1 M Imidazole. Again a western blot was used to detect OPG containing fractions. 

Pooled fractions from the NiNTA column were dialyzed into 1 0mm potassium phosphate pH 7.0, 0.2mM DTT The 
dialyzed pool was then applied to a ceramic hydroxyapatite column (Bio-Rad) which had been equilibrated in 10mM 
phosphate buffer. After column washing, the protein was eluted with a 10-100mM potassium phosphate gradient over 
55 20 column volumes. This was then followed by a 20 column volume gradient of 100-400 mM phosphate. 

OPG was detected by coomassie blue staining of SDS-polyacryiamide gels and by western blotting. Fractions 
were pooled and diafiltered onto PBS and frozen at -80°C. The purified protein runs as a monomer and will remain so 
after diafiltration into PBS. The monomer is stable when stored frozen or at pH 5 at 4°C. However if stored at 4°C in 
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PBS, dimers and what appears to be trimers and tetramers will form after one week. 
D. Purification of human OPG metf 22-4011 from E. coli 

s The bacterial cell paste was suspended into 10 mM EDTA to a concentration of 15% (w/v) using a low shear 

homogenizer at 5°C. The cells were then disrupted by two homogenizations at 15,000 psi each at 5°C. The resulting 
homogenate was centrifuged at 5,000 x g for one hour at 5°C. The centrifugal pellet was washed by low shear homog- 
enization into water at the original homogenization volume followed by centrifugation as before. The washed pellet 
was then solubilized to 15% (w/v) by a solution of (final concentration) 6 M guanidine HCl, 10 mM dithiothreitol, 10 mM 

10 TrisHCI pH 8 5 at ambient temperature for 30 minutes. This solution was diluted 30-fold into 2M urea containing 50 
mM CAPS, pH 10.5, 1 mM reduced glutathione and then stirred for 72 hours at 5°C. The OPG was purified from this 
solution at 25°C by first adjustment to pH 4.5 with acetic acid and then chromatography over a column of SP-HP 
Sepharose resin equilibrated with 25 mM sodium acetate, pH 4.5. The column elution was carried out with a linear 
sodium chloride gradient from 50 mM to 550 mM in the same buffer using 20 column volumes at a flow rate of 0.1 

75 column volumes/minute. The peak fractions containing only the desired OPG form were pooled and stored at 5°C or 
buffer exchanged into phosphate buffered saline, concentrated by ultrafiltration, and then stored at 5°C. This material 
was analyzed by reverse phase HPLC, SDS-PAGE, limulus amebocyte lysate assay for the presence of endotoxin, 
and N-terminal sequencing. In addition, techniques such as mass spectrometry, pH/temperature stability, fluoresence, 
circular dichroism, differential scanning calorimetry, and protease profiling assays may also be used to examine the 

20 folded nature of the protein. 

EXAMPLE 11 

Biological Activity of Recombinant OPG 



25 



Based on histology and histomorphometry, it appeared that hepatic overexpression of OPG in transgenic mice 
markedly decreased the numbers of osteoclasts leading to a marked increase in bone tissue (see Example 4). To gain 
further insight into potential mechanism(s) underlying this in vivo effect, various forms of recombinant OPG have been 
tested in an in vitro culture model of osteoclast formation (osteoclast forming assay). This culture system was originally 
30 devised by Udagawa (Udagawa et al. Endocrinologyl25, 1 805-1 813(1 989), Proc. Natl. Acad. Sci. USA 87, 7260-7264 
(1990)) and employs a combination of bone marrow cells and cells from bone marrow stromal cell lines. A description 
of the modification of this culture system used for these studies has been previously published (Lacey et al. Endo- 
crinology 136 2367-2376 (1995)). In this method, bone marrow cells, flushed from the femurs and fcbiae of mice, are 
cultured overnight in culture media (alpha MEM with 10% heat inactivated fetal bovine serum) supplemented with 500 
35 u/ml GSF-1 (colony stimulating factor 1 , also called M-CSF), a hematopoietic growth factor specific for cells of the 
monocyte/macrophage family lineage. Following this incubation, the non-adherent cells are collected, subjected to 
gradient purification, and then cocultured with ceils from the bone marrow cell line ST2 (1 x 10 6 non-adherent cells : 
1 x 105 sT2 cells/ ml media). The media is supplemented with dexamethasone (100 nM) and the biologically-active 
metabolite of vitamin D3 known as 1 ,25 dihydroxyvitamin D3 (1 ,25 (OH)2 D3, 10 nM). To enhance osteoclast appear- 
40 ance prostaglandin E2 (250 nM) is added to some cultures. The coculture period usually ranges from 8 - 10 days and 
the media with all of the supplements freshly added, is renewed every 3-4 days. At various intervals, the cultures are 
assessed for the presence of tartrate acid phosphatase (TRAP) using either a histochemical stain (Sigma Kit # 387 A, 
Sigma St Louis, MO) or TRAP solution assay. The TRAP histochemical method allows for the identification of oste- 
oclasts phenotypically which are multinucleated (> 3 nuclei) cells that are also TRAP+. The solution assay involves 
45 lysing the osteoclast-containing cultures in a citrate buffer (100 mM, pH 5.0) containing 0.1% Triton X-100. Tartrate 
resistant acid phosphatase activity is then measured based on the conversion of p-nitrophenylphosphate (20 nM) to 
p-nitrophenol in the presence of 80 mM sodium tartrate which occurs during a 3-5 minute incubation at RT. The reaction 
is terminated by the addition of NaOH to a final concentration of 0.5 M. The optical density at 405 nm is measured and 
the results are plotted. 

so Previous studies (Udagawa et al. jbid) using the osteoclast forming assay have demonstrated that these cells 

express receptors for 125 l -calcitonin (autoradiography) and can make pits on bone surfaces, which when combined 
with TRAP positivity confirm that the multinucleated cells have an osteoclast phenotype. Additional evidence in support 
of the osteoclast phenotype of the multinucleated cells that arise in vitro in the osteoclast forming assay are that the 
cells express av and p3 integrins by immunocytochemistry and calcitonin receptor and TRAP mRNA by in situ hybnd- 

ss ization (ISH). . . 

The huOPG [22-401 ]-Fc fusion was purified from CHO cell conditioned media and subsequently utilized in the 
osteoclast forming assay. At 1 00 ng/m! of huOPG [22-401 ]-Fc, osteoclast formation was virtually 1 00% inhibited (Figure 
19A). The levels of TRAP measured in lysed cultures in microtitre plate wells were also inhibited in the presence of 
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OPG with an ID 50 of approximately 3 ng/ml (Figure 20). The level of TRAP activity in lysates appeared to correlate 
with the relative number of osteoclasts seen by TRAP cytochemistry (compare Figures 19A-19G and 20). Purified 
human lgG1 and TNFbp were also tested in this model and were found to have no inhibitory or stimulatory effects 
suggesting that the inhibitory effects of the huOPG [22-401 ]-Fc were due to the OPG portion of the fusion protein. 
5 Additional forms of the human and murine molecules have been tested and the cumulative data are summarized in 
Table 1 . 



10 



Table 1 

Effects of various OPG forms on in vitro 
osteoclast formation 



15 



25 



30 



35 



40 



45 



50 



. QPG Construct 



Relative Bioactivity in vitro 



20 



muOPG [22-401] -Fc 
muOPG [22-194] -Fc 



+++ 
+++ 
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55 



BNSDOCID: <EP 0784093A1_I_> 



EP 0 784 093 A1 



10 



15 



20 



muOPG [22-185] -Fc 
muOPG [22-180] -Fc 
muOPG [22-401] 
muOPG [22-401] C195 
muOPG [22-401] C202 
muOPG [22-401] C277 
muOPG [22-401] C319 
muOPG [22-401] C400 
muOPG [22-185] 
muOPG [22-194] 
muOPG [22-200] 
muOPG [22-212] 
muOPG [22-2 93] 
muOPG [22-355] 



++ 

+++ 
+++ 
+ 

+ 
+ 

++ 

+++ 
+++ 



25 



30 



35 



huOPG [22-401]-Fc 
huOPG [22-201] -Fc 
huOPG [22-401] -Fc P26A 
huOPG [22-401 ]-Fc Y28F 
huOPG [22-401] 
huOPG [27-401] -Fc 
huOPG [29-401] -Fc 
huOPG [32-401] -Fc 



+++ 

+++ 

+++ 

+++ 

+++ 

++ 

++ 

+ /- 



40 



45 



50 



55 



+++, ED 50 = 0.4-2 ng/ml 



++/ 



ED 50 = 2-10 ng/ml 



ED 50 = 10-100 ng/ml 



ED 50 > 100 ng/ml 



The cumulative data suggest that murine and human OPG amino acid sequences 22-401 are fully active in vitro, 
when either fused to the Fc domain, or unfused. They inhibit in a dose<lependent manner and possess half-maximal 
activities in the 2-10 ng/ml range. Truncation of the murine C-terminus at threonine residue 180 inactivates the molecule, 
whereas truncations at cysteine 185 and beyond have full activity. The cysteine residue located at position 185 is 
predicted to form an SS3 bond in the domain 4 region of OPG. Removal of this residue in other TNFR-related proteins 
has previously been shown to abrogate biological activity (Yan et al. J. Biol. Chem. 266, 12099-12104 (1994)). Our 
finding that muOPG[22-180]-Fc is inactive while muOPG[22-185]-Fc is active is consistent with these findings. This 
suggests that amino acid residues 22-185 define a region for OPG activity. 

These findings indicate that like transgenically-expressed OPG, recombinant OPG protein also suppressed oste- 
oclast formation as tested in the osteoclast forming assay. Time course experiments examining the appearance of 
TRAP+ cells, p3+ cells, F480+ cells in cultures continuously exposed to OPG demonstrate that OPG blocks the ap- 
pearance TRAP+ and p3+ cells, but not F480+ cells. In contrast, TRAP+ and p3+ cells begin to appear as early as 
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day 4 following culture establishment in control cultures. Only F480+ cells can be found in OPG-treated cultures and 
they appear to be present at qualitatively the same numbers as the control cultures. Thus, the mechanism of OPG 
effects in vitro appears to involve a blockade in osteoclast differentiation at a step beyond the appearance of monocyte- 
macrophages but before the appearance of cells expressing either TRAP or P3 integrins. Collectively these findings 

5 indicate that OPG does not interfere with the general growth and differentiation of monocyte -macrophage precursors 
from bone marrow, but rather suggests that OPG specifically blocks the selective differentiation of osteoclasts from 
monocyte-macrophage precursors. 

To determine more specifically when in the osteoclast differentiation pathway that OPG was inhibitory, a variation 
of the in vitro culture method was employed. This variation, described in (Lacey et al. supra ), employs bone marrow 

10 macrophages as osteoclast precursors. The osteoclast precursors are derived by taking the nonadherent bone marrow 
cells after an overnight incubation in CSF-1/M-CSF, and culturing the cells for an additional 4 days with 1 ,000 - 2,000 
U/ml CSF-1 . Following 4 days of culture, termed the growth phase, the non-adherent cells are removed. The adherent 
cells, which are bone marrow macrophages, can then be exposed for up to 2 days to various treatments in the presence 
of 1,000 - 2,000 U/ml CSF-1. This 2 day period is called the intermediate differentiation period. Thereafter, the cell 

is layers are again rinsed and then ST-2 celts (1 X 10 5 cell/ml), dexamethasone (100 nM) and 1 ,25 (OH)2 D3 (10 nM) 
are added for the last 8 days for what is termed the terminal differentiation period. Test agents can be added during 
this terminal period as well. Acquisition of phenotypic markers of osteoclast differentiation are acquired during this 
terminal period (Lacey et al. ibid ). 

huOPG [22-401 ]-Fc (100 ng/ml) was tested for its effects on osteoclast formation in this model by adding it during 

20 either the intermediate, terminal or, alternatively, both differentiation periods. Both TRAP cytochemistry and solution 
assays were performed. The results of the solution assay are shown in Figure 21. HuOPG [22-401 ]-Fc inhibited the 
appearance of TRAP activity when added to both the intermediate and terminal or only the terminal differentiation 
phases. When added to the intermediate phase and then removed from the cultures by rinsing, huOPG [22-401]-Fc 
did not block the appearance of TRAP activity in culture lysates. The cytochemistry results parallel the solution assay 

2S data. Collectively, these observations indicate that huOPG [22-401]-Fc only needs to be present during the terminal* 
differentiation period for it to exert its all of its suppressive effects on osteoclast formation. 

B. In vivo I L1 -a and IL1 -B challenge experiments 

30 | LI increases bone resorption both systemically and locally when injected subcutaneously over the calvaria of 

mice (Boyce et al., Endocrinology 1 25 , 1142-1150 (1989)). The systemic effects can be assessed by the degree of 
hypercalcemia and the local effects histologically by assessing the relative magnitude of the osteoclast-mediated re- 
sponse. The aim of these experiments was to determine if recombinant muOPG [22-401 ]-Fc could modify the local 
and/or systemic actions of IL1 when injected subcutaneously over the same region of the calvaria as IL1 . 

35 

1L-1 & experiment 

Male mice (ICR Swiss white) aged 4 weeks were divided into the following treatment groups (5 mice per group): 
Control group: IL1 treated animals (mice received 1 injection/day of 2.5 ug of IL1-P); Low dose muOPG [22-401]-Fc 

40 treated animals (mice received 3 injections/day of 1 jig of muOPG [22-401 ]-Fc); Low dose muopg [22-401 ]-Fc and 
IL1-P; High dose muOPG [22-401]-Fc treated animals (mice receive 3 injections/day of 10 jig muOPG [22-401]-Fc); 
High dose muOPG [22-401 ]-Fc and IL1-p. All mice received the same total number of injections of either active factor 
or vehicle (0.1% bovine serum albumin in phosphate buffered saline). All groups are sacrificed on the day after the 
last injection. The weights and blood ionized calcium levels are measured before the first injections, four hours after 

45 the second injection and 24 hours after the third IL1 injection, just before the animals were sacrificed. After sacrifice 
the calvaria were removed and processed for paraffin sectioning. 

IL1-a experiment 

so Male mice (ICR Swiss white) aged 4 weeks were divided into the following treatment groups (5 mice per group): 

Control group; IL1 alpha treated animals (mice received 1 injection/day of 5 ug of I L1 -alpha); Low dose muOPG 
[22-401 ]-Fc treated animals (mice received 1 injection/day of 10 uxj of muOPG [22-401 ]-Fc; Low dose muopg [22-401]- 
Fc and IL1 -alpha, (dosing as above); High dose muopg [22-401 ]-Fc treated animals (mice received 3 injections/day 
of 10 p.g muOPG [22-401 ]-Fc; High dose muOPG [22-401 ]-Fc and IL1-a. AN mice received the same number of injec- 
ts tions/day of either active factor or vehicle. All groups were sacrificed on the day after the last injection. The blood 
ionized calcium levels were measured before the first injection, four hours after the second injection and 24 hours after 
the third IL1 injection, just before the animals were sacrificed. The animal weights were measured before the first 
injection, four hours after the second injection and 24 hours after the third IL1 injection, just before the animals were 
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sacrificed. After sacrifice the calvaria were removed and processed for paraffin sectioning. 



Histological methods 

5 Calvarial bone samples were fixed in zinc formalin, decalcified in formic acid, dehydrated through ethanol and 

mounted in paraffin. Sections (5pm thick) were cut through the calvaria adjacent to the lambdoid suture and stained 
with either hematoxylin and eosin or reacted for tartrate resistant acid phosphatase activity (Sigma Kit# 387A) and 
counterstained with hematoxylin. Bone resorption was assessed in the IL1 -a treated mice by histomorphometric meth- 
ods using the Osteomeasure (Osteometries, Atlanta, G A) by tracing histologic features onto a digitizer platen using a 

10 microscope-mounted camera lucida attachment. Osteoclast numbers, osteoclast lined surfaces, and eroded surfaces 
were determined in the marrow spaces of the calvarial bone. The injected and non-injected sides of the calvaria were 
measured separately. 



Results 



15 



IL1-a and IL1-p produced hypercalcemia at the doses used, particularly on the second day, presumably by the 
induction of increased bone resorption systemically. The hypercalcemic response was blocked by muOPG [22-401]- 
Fc in the ILl-beta treated mice and significantly diminished in mice treated with IL1 -alpha, an effect most apparent on 
day 2 (Figure 22A-22B). 

20 Histologic analysis of the calvariae of mice treated with IL1 -alpha and beta shows that IL1 treatments alone produce 

a marked increase in the indices of bone resorption including: osteoclast number, osteoclast lined surface, and eroded 
surface (surfaces showing deep scalloping due to osteoclastic action (Figure 23B, Table 2). In response to ILI-o or 
IL1-B the increases in bone resorption were similar on the injected and non-injected sides of the calvaria. Muopg 
[22-401 ]-Fc injections reduced bone resorption in both I L1 -alpha and beta treated mice and in mice receding vehicle 

2S alone but this reduction was seen only on the muopg [22-401 ]-Fc injected sides of the calvariae. 

The most likely explanation for these observations is that muOPG [22-401 ]-Fc inhibited bone resorpt.on, a con- 
clusion supported by the reduction of both the total osteoclast number and the percentage of available bone surface 
undergoing bone resorption, in the region of the calvaria adjacenttothe muOPG [22-401 ]-Fc injection sites. The actons 
of muOPG [22-401]-Fc appeared to be most marked locally by histology, but the fact that muOPG [22-401]-Fc also 

30 blunted IL1-induced hypercalcemia suggests that muOPG [22-401]-Fc has more subtle effects on bone resorption 
systemically. 
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C. Systemic Effects of muOPG [22-4011-Fc in Growing Mice 

Male BDF1 mice aged 3-4 weeks, weight range 9.2- 15.7g were divided into groups often mice per group. These 
mice were injected subcutaneousiy with saline or muOPG [22-401 ]-Fc 2.5mg/kg bid for 14 days (5mg/kg/day). The 
mice were radiographed before treatment, at day 7 and on day 14. The mice were sacrificed 24 hours after the final 
injection. The right femur was removed, fixed in zinc formalin, decalcified in formic acid and embedded in paraffin. 
Sections were cut through the mid region of the distal femoral metaphysis and the femoral shaft. Bone density, by 
histomorphometry, was determined in six adjacent regions extending from the metaphyseal limit of the growth plate, 
through the primary and secondary spongiosa and into the femoral diaphysis (shaft). Each region was 0.5 X 0.5 mm 2 . 

Radiographic changes 

After seven days of treatment there was evidence of a zone of increased bone density in the spongiosa associated 
with the growth plates in the OPG treated mice relative to that seen in the controls. The effects were particularly striking 
15 in the distal femoral and the proximal tibial metaphases (Figure 24A-24B). However bands of increased density were 
also apparent in the vertebral bodies, the iliac crest and the distal tibia. At 1 4 days, the regions of opacity had extended 
further into the femoral and tibial shafts though the intensity of the radio-opacity was diminished. Additionally, there 
were no differences in the length of the femurs at the completion of the experiment or in the change in length over the 
duration of the experiment implying that OPG does not alter bone growth. 



10 



20 



Histological Changes 



The distal femoral metaphysis showed increased bone density in a regions 1.1 to 2.65 mm in distance from the 
growth plate (Figures 25 and 26A-26B). This is a region where bone is rapidly removed by osteoclast-mediated bone ^ 
25 resorption in mice. In these rapidly growingyoung mice, the increase in bone in this region observed with OPG treatment * 
is consistent with an inhibition of bone resorption. 

D. Effects of Qsteoproteaerin on Bone Loss Induced by Ovariecto my in the Rat 

30 Twelve week old female Fisher rats were ovariectomized (OVX) or sham operated and dual xray absorptiometry 

(DEXA) measurements made of the bone density in the distal femoral metaphysis. After 3 days recovery period, the 
animals received daily injections for 14 days as follows: Ten sham operated animals received vehicle (phosphate 
buffered saline); Ten OVX animals received vehicle (phosphate buffered saline); Six OVX animals received OPG-Fc 
5mg/kg SC; Six OVX animals received pamidronate (PAM) 5mg/kg SC; Six OVX animals received estrogen (ESTR) 

ss 40ug/kg SC. After 7 and 1 4 days treatment the animals had bone density measured by DEXA. Two days after the last 
injection the animals were killed and the right tibia and femur removed for histological evaluation. 

The DEXA measurements of bone density showed a trend to reduction in the bone density following ovariectomy 
that was blocked by OPG-Fc. Its effects were similar to the known antiresorptive agents estrogen and pamidronate. 
(Figure 27). The histomorphometric analysis confirmed these observations with OPG-Fc treatment producing a bone 

40 density that was significantly higher in OVX rats than that seen in untreated OVX rats (Figure 28). These results confirm 
the activity of OPG in the bone loss associated with withdrawal of endogenous estrogen following ovariectomy. 

In vivo Summary 

45 The in vivo actions of recombinant OPG parallel the changes seen in OPG transgenic mice. The reduction in 

osteoclast number seen in the OPG transgenic is reproduced by injecting recombinant OPG locally over the calvaria 
in both normal mice and in mice treated with IL1-aor ILt-p. The OPG transgenic mice develop an osteopetrotic phe- 
notype with progressive filling of the marrow cavity with bone and unremodelled cartilage extending from the growth 
plates from day 1 onward after birth. In normal three week old (growing) mice, OPG treatments also led to retention 

so of bone and unremodelled cartilage in regions of endochondral bone formation, an effect observed radiographically 
and confirmed histologically. Thus, recombinant OPG produces phenotypic changes in normal animals similar to those 
seen in the transgenic animals and the changes are consistent with OPG-induced inhibition of bone resorption. Based 
on in vitro assays of osteoclast formation, a significant portion of this inhibition is due to impaired osteoclast formation. 
Consistent with this hypothesis, OPG blocks ovariectomy-induced osteoporosis in rat. Bone loss in this model is known 

ss to be mediated by activated osteoclasts, suggesting a role for OPG in treatment of primary osteoporosis. 
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EXAMPLE 12 

Pegylation Derivatives of OPG 

5 Preparation of N-terminal PEG-OPG conjugates by reductive alkylation 

HuOPG met [22-1 94] P25A was buffer exchanged into 25-50 mM NaOAc, pH 4.5-4.8 and concentrated to 2-5 mg/ 
ml. This solution was used to conduct OPG reductive alkylation with monofunctional PEG aldehydes at 5-7 °C. PEG 
monofunctional aldehydes, linear or branched, MW=1 to 57 kDa (available from Shearwater Polymers) were added to 

10 the OPG solution as solids in amounts constituting 2-4 moles of PEG aldehyde per mole of OPG. After dissolution of 
polymer into the protein solution, sodium cyanoborohydride was added to give a final concentration of 15 to 20 mM in 
the reaction mixture from 1-1.6 M freshly prepared stock solution in cold Dl water. The progress of the reaction and 
the extent of OPG PEGylation was monitored by size exclusion HPLC on a G3000SW XL column (Toso Haas) eluting 
with 100 mM NaP0 4 , 0.5 M NaCI, 10% ethanol, pH 6.9. Typically the reaction was allowed to proceed for 16-18 hours, 

15 after which the reaction mixture was diluted 6-8 times and the pH lowered to 3.5-4. The reaction mixture was fractionated 
by ion exchange chromatography (HP SP HiLoad 16/10, Pharmacia) eluting with 20 mM NaOAc pH 4 with a linear 
gradient to 0.75M NaCI over 25 column volumes at a flow rate of 30 cm/h. Fractions of mono-, di- or poly-PEGylated 
OPG were pooled and characterized by SEC HPLC and SDS-PAGE. By N-terminal sequencing/it was determined 
that the monoPEG-OPG conjugate, the major reaction product in most cases, was 98% N-terminally PEG-modified 

20 OPG. 

This procedure was generally used to prepare the following N-terminal PEG-OPG conjugates (where OPG is 
HuOPG met [22-194] P25A: 5 kD monoPEG, 10 kD mono branched PEG, 12 kD monoPEG, 20 kD monoPEG, 20 kD 
mono branched PEG, 25 kD monoPEG, 31 kD monoPEG, 57 kD monoPEG, 12kDdiPEG, 25 kD diPEG, 31 kDdiPEG, 
57 kD diPEG, 25 kD triPEG. 

25 

Preparation of PEG-OPG conjugates by acylation 

HuOPG met [22-1 94] P25A was buffer exchanged into 50 mM BICINE buffer, pH 8 and concentrated to 2-3 mg/ 
ml. This solution was used to conduct OPG acylation with monofunctional PEG N-hydroxysuccinimidyl esters at room 

30 temperature. PEG N-hydroxysuccinimidyl esters, linear or branched, MW=1 to 57 kDa (available from Shearwater 
Polymers) were added to the OPG solution as solids in amounts constituting 4-8 moles of PEG N-hydroxysuccinimidyl 
ester per mole of OPG. The progress of the reaction and the extent of OPG PEGylation was monitored by size exclusion 
HPLC on a G3000SW XL column (Toso Haas) eluting with 1 00 mM NaP0 4 , 0.5 M NaCI, 10% ethanol, pH 6.9. Typically 
the reaction was allowed to proceed for 1 hour, after which the reaction mixture was diluted 6-8 times and the pH 

35 lowered to 3.5-4. The reaction mixture was fractionated by ion exchange chromatography (HP SP HiLoad 16/10, Phar- 
macia) eluting with 20 mM NaOAc pH 4 with a linear gradient to 0.75M NaCI over 25 column volumes at a flow rate of 
30 cm/h. Fractions of mono-, di- or poly- PEGylated OPG were pooled and characterized by SEC HPLC and SDS- 
PAGE. 

This procedure was generally used to prepare the following PEG-OPG conjugates: 5 kD polyPEG, 20 kD polyPEG, 
40 40 kD poly branched PEG, 50 kD poly PEG. 

Preparation of dimeric PEG-OPG 

HuOPG met [22-194] P25A is prepared for thiolation at 1-3 mg/ml in a phosphate buffer at near neutral pH. S- 
45 acetyl mecaptosuccinic anhydride (AMSA) is added in a 3-7 fold molar excess while maintaining pH at 7.0 and the rxn 
stirred at 4°C for 2 hrs. The monothiolated-OPG is separated from unmodified and polythiolated OPG by ion exchange 
chromatography and the protected thiol deprotected by treatment with hydroxylamine. After deprotection, the hydrox- 
ylamine is removed by gel filtration and the resultant monothiolated-OPG is subjected to a variety of thiol specific 
crosslinking chemistries. To generate a disulfide bonded dimer, the thiolated OPG at >lmg/m! is allowed to undergo air 
so oxidation by dialysis in slightly basic phosphate buffer. The covalent thioether OPG dimer was prepared by reacting 
the bis-maleimide crosslinker, N,N-bis(3-maleimido propianyl) -2 -hydroxy 1 ,3 propane with the thiolated OPG at >1 mg/ 
ml at a 0.6x molar ratio of Crosslin ker:OPG in phosphate buffer at pH 6.5. Similarly, the PEG dumbbells are produced 
by reaction of substoichiometric amounts of bis-maleimide PEG crosslinkers with thiolated OPG at >1 mg/ml in phos- 
phate buffer at pH 6.5. Any of the above dimeric conjugates may be further purified using either ion exchange or size 
ss exclusion chromatographies. 

Dimeric PEG-OPG conjugates (where OPG is HuOPG met [22-194] P25A prepared using the above procedures 
include disulfide-bonded OPG dimer, covalent thioether OPG dimer with an aliphatic amine type crosslinker, 3.4 kD 
and 8kD PEG dumbbells and monobells. 
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PEG-OPG conjugates were tested for activity in vitro using the osteoclast maturation assay described in Example 
1 1 A and for activity in vivo by measuring increased bone density after injection into mice as described in Example 11 C. 
The in vivo activity is shown below in Table 3. 

Table 3 



In vivo biological activity of Pegylated OPG 



OPG Construct 



muOPG met [22-194] 
muOPG met [22-194] 5k PEG 
muOPG met [22-194] 20k PEG 

huOPG met [22-194] P25A 
huOPG met [22-194] P25A 5k PEG 
huOPG met [22-194] P25A 20k PEG 
huOPG met [22-194] P25A 31k PEG 
huOPG met [22-194] P25A 57k PEG 
huOPG met [22-194] P25A 12k PEG 
huOPG met [22-194] P25A 20k Branched PEG 
huOPG met [22-194] P25A 8k PEG dimer 
huOPG met [22-194] P25A disulfide crosslink 



Increase in Tibial Bone Density 



+ 



+ 
+ 
+ 
+ 
+ 
+ 
+ 



While the invention has been described in what is considered to be its preferred embodiments, it is not to be limited * 
to the disclosed embodiments, but on the contrary, is intended to cover various modifications and equivalents included 
within the spirit and scope of the appended claims, which scope is to be accorded the broadest interpretation so as to 
encompass all such modifications and equivalents. 

The features disclosed in the foregoing description, in the following claims and/or in the accompanying drawings 
may, both separately and in any combination thereof, be material for realising the invention in diverse forms thereof. , 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT 

(A) NAME: Amgen Inc. 

(B) STREET: 1840 Dehavilland Drive 

(C) CITY: Thousand Oaks 

(D) STATE: California 

(E) COUNTRY: United States 

(F) ZIP: 91320 

(ii) TITLE OF INVENTION: OSTEOPROTEGERIN 
(iii) NUMBER OF SEQUENCES : 168 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1 . 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 96309363.8 

(B) FILING DATE: 2 0 December 19 9 6 

(vi) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Brown, John D. 

(B) FIRM: Forrester & Boehmert 

(C) REFERENCE/DOCKET NUMBER: FB6253 -E11066EP 

(vii) ATTORNEY/ AGENT CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Forrester & Boehmert, 

(B) STREET: Franz - Joseph-Strasse 38, 

(C) CITY: D-80801 Munchen 

(D) COUNTRY: Germany 

(E) TELEX: 5242 82 FORBO D 

(F) FAX: 089 34 70 10 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
AAAGGAAGGA AAAAAGCGGC CGCTACANNN NNNNNT 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TCGACCCACG CGTCCG £ 
(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GGGTGCGCAG GC 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TGTAAAACGA CGGCCAGT 18 

5 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CAGGAAACAG CTATGACC j 18 

(2) INFORMATION FOR SEQ ID NO : 6-: 

t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CAATTAACCC TCACTAAAGG 20 
(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 
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GCATTATGAC CCAGAAACCG GAC 
(2) INFORMATION FOR SEQ ID NO:8: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

AGGTAGCGCC CTTCCTCACA TTC 

INFORMATION FOR SEQ ID NO : 9 : 

j 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acvd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear*' 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GACTAGTCCC ACAATGAACA AGTGGCTGTG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
ATAAGAATGC GGCCGCTAAA CTATGAAACA GCCCAGTGAC CATTC 

so 

(2) INFORMATION FOR SEQ ID NO: 11: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCTCTAGAA AGAGCTGGGA C 
15 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 4 

(ii) MOLECULE TYPE: cDNA 

V 

25 

(xi) SEQUENCE DESCRIPTION: SEQ "ID NO: 12: 
CGCCGTGTTC CATTTATGAG C 
30 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATCAAAGGCA GGGC AT ACT T CCTG 
(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
so (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GTTGCACTCC TGTTTCACGG TCTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ^ID NO: 15: 
CAAGACACCT TGAAGGGCCT GATG ^ 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TAACTTTTAC AGAAGAGCAT CAGC 



(2) INFORMATION FOR SEQ ID NO : 1*7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
AGCGCGGCCG CATGAACAAG TGGCTGTGCT GCG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



<xi) SEQUENCE DESCRIPTION: S£Q ID NO: 18 
AGCTCTAGAG AAACAGCCCA GTGACCATT^ C 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTGAAGCTGT GCAAGAACCT GATG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
ATCAAAGGCA GGGCATACTT CCTG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CAGATCCTGA AGCTGCTCAG TTTG * 
(2) INFORMATION FOR SEQ ID NO : 22 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 
AGCGCGGCCG CGGGGACCAC AATGAACAAG TTG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AGCTCTAGAA TTGTGAGGAA ACAGCTCAAT GGC 33 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: $EQ ID NO:24: 

ATAGCGGCCG CTGAGCCCAA ATCTTGTGAC AAAACTCAC 39 

V 

(2) INFORMATION FOR SEQ ID.NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid • 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TCTAGAGTCG ACTTATCATT TACCCGGAGA CAGGGAGAGG CTCTT 45 
<2) INFORMATION FOR SEQ ID NO: 26: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

so 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 

5 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
CCTCTGCGGC CGCTAAGCAG CTTATTTTCA CGGATTGAAC CTG 
(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

t 

(ii) MOLECULE TYPE: cDNA 



so 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TCCGTAAGAA ACAGCCCAGT GACC 24 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CCTCTGCGGC CGCTGTTGCA TTTCCTTTCT G 31 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino .acids 

(B) TYPE: amino acicf 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Glu Thr Leu Pro Pro Lys Tyr Leu His Tyr Asp Pro Glu Thr Gly His 
15 10 15 

40 

Gin Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 32: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TCCCTTGCCC TGACCACTCT T 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :. 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
CCTCTGCGGC CGCACACACG TTGTCATGT^ TTGC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
TCCCTTGCCC TGACCACTCT T 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCTCTGCGGC CGCCTTTTGC GTGGCTTCTC TGTT 3 4 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CCTCTGAGCT CAAGCTTGGT TTCCGGGGfcC CACAATG 37 
(2) INFORMATION FOR SEQ ID NO:^7: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CCTCTGCGGC CGCTAAGCAG CTTATTTTTA CTGAATGG 38 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CCTCTGAGCT CAAGCTTGGT TTCCGGGGAC CACAATG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
CCTCTGCGGC CGCCAGGGTA ACATCTATT^ CAC 
(2) INFORMATION FOR SEQ ID NO:4P: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CCGAAGCTTC CACCATGAAC AAGTGGCTGT GCTGC 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CCTCTGTCGA CTATTATAAG CAGCTTATTT TCACGGATTG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

j 

TCCCTTGCCC TGACCACTCT T 
(2) INFORMATION FOR SEQ ID NO:^3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CCTCTGTCGA CTTAACACAC GTTGTCATGT GTTGC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TCCCTTGCCC TGACCACTCT T 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

i 

CCTCTGTCGA CTTACTTTTG CGTGGCTTCT CTGTT 

t 

(2) INFORMATION FOR SEQ ID NO : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GTGAAGAGCG TGAAGAGCGG TTCCTCCTTT CAGCAAAAAA CCCCTCAAGA CCCGTTTAGA 
GGCCCCAAGG GGTTATGCTA GTTATTGCTC AGCGGTGGCA GCAGCCAACT CAGCTTCCTT 
TCGGGCTTTC TTCTTCTTCT TCTTCTTTCC GCGGATCCTC GAGTAAGCTT CCATGGTACC 
CTGCAGGTCG ACACTAGTGA GCTCGAATTC C AACGCGT T A ACCATATGTT ATTCCTCCTT 
TAATTAGTTA AAACAAATCT AGAATCAAAT CGATTAATCG ACTATAACAA ACCATTTTCT 
TGCGTAAACC TGTACGATCC TACAGGTACT TATGTTAAAC AATTGTATTT CAAGCGATAT 
* AATAGTGTGA CAAAAATCCA ATTTATTAGA ATCAAATGTC AATCTATTAC CGTTTTAATG 
ATATATAACA CGCAAAACTT GCGACAAACA ATAGGTAAGG ATAAAGAGAT GGGTATGAAA 
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GACATAAATG 


CCGACGACAC 


rp rp ft /"» ft /~* 'ft *fn» 


Ax inAJ. AAAA 


TT A. AAfiPCTG 


TAG AAGC AAT 


540 




AATGATATTA 


ATCAATGCTT 


TV rp/^ ft rp j\ rp/"* 
ATCT*J A 1 A J. Vj 


AO 1 Annn 1 oLj 


TAPATTRTGA 


ATATTATTTA 

*V X a* X X *V XXX J* 


600 


5. 


CTCGCGATCA 


TTTATCCTCA 


TTCTATGGTT 


ft ft ft rp/-> rp/— ft rn ft 


HPT IT 1 A TiTTPT 


at rattap 


VJ VJ VJ 




CCTAAAAAAT 


GG AGGCAAT A 


TTATGATGAC 


GCTAATTTAA 


rn ft ft ft ft rp ft rp/-* ft 
TAAAATATVjA 


rp/""»/'"»rpftrpft/^rpft 

TLL I A I Avj r A 


/ Z U 


10 


GATTATTCTA 


ACTCCAATCA 


TTCACCGATT 


AATTGGAATA 


TATTTGAAAA 


CAATGCTGTA 


/ oO 




AATAAAAAAT 


CTCCAAATGT 


AATTAAAGAA 


GCGAAATCAT 


CAGGTCTTAT 


CACTGGGTTT 


Q >1 

8 40 


15 


AGTTTCCCTA 


TTCATACTGC 


TAATAATGGC 


TTCGGAATGC 


TTAGTTTTGC 


ft r"^ ft rp ry*/^ ft /"* ft 

ALA T TLAbAb 


u u 


AAAGACAACT 


ATATAGATAG 


TTTATTTTTA 


CATGCGTGTA 


m/* ft * 0 ft rp ft /*■» /*"' 

T G AAC AT ALL 


ft rp rp ft ftrprp/^rprp 

Al I AA1 loil 


q n 




CCTTCTCTAG 


TTGATAATTA 


TCGAAAAATA 


AATATAGCAA 


ATAATAAATC 


71 t\ ft i\ ft f**r^ ft rp 

AAAC AAC GAT 




20 


TTAACCAAAA 


GAGAAAAAGA 


ATGTTTAGCG 


TGGGCATGCG 


AAGGAAAAAG 


CTCTTGGGAT 


lOou 




ATTTCAAAAA 


TATTAGGCTG 


TAGTAAGCGC 
CCGCTGCCAA 


ACGGTCACTT 


TCCATTTAAC 


CAATGCGCAA 


1140 


25 


ATGAAACTCA 


ATACAACAAA 


AGTATTTCTA 


AAGCAATTTT 


AACAGGAGCA 


1 0 r» A 

1200 




ATTGATTGCC 


CATACTTTAA 


AAGTTAAGTA 


CGACGTCCAT 


ATTTGAATGT 


ATTTAGAAAA 


1^£ bO 




ATAAACAAAA 


GAGTTTGTAG 


AAACGCAAAA 


AGGCCATCCG 


TCAGGATGGC 


/-»rprp/-»rp/^/~«rprp ft 

CTTCTCjC XT A 


1 *? 


30 


ATTTGATGCC 


TGGCAGTTTA 


TGGCGGGCGT 


CCTGCCCCaCC 


ft rT*f~**T*f+cr*nn 
ACLL 1 CCvjov? 




1 JOU 




GCAACGTTCA 


AATCCGCTCC 


CGGCGGATTT 


GTCCTACTCA 


GGAGAGCGTT 


CACCGACAAA 


3:440 


35 


CAACAGATAA 


AACGAAAGGC 


CCAGTCTTTC 


GACTGAGCCT 


TTCGTTTTAT 


TTGATGCCTG 


1500 




GCAGTTCCCT 


ACTCTCGCAT 


GGGGAGACCA 


TGCATAC 






1537 




(2) INFORMATION FOR SEQ ID NO: 47 











40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
45 (D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CCGGCGGACA TTTATCACAC AGCAGCTGAT GAGAAGTTTC TTCATCCA 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

20 

CGATTTGATT CTAGAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GGTAC 55 
(2) INFORMATION FOR SEQ ID NO: 49: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 49 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
CGAATTCCAA CGCGTTAACC ATATGTTATT CCTCCTTCTA GAATCAAAT 4 9 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GCGTAACGTA TGCATGGTCT CCCCATGCGA GAGTAGGGAA CTGCCAGGCA TCAAATAAAA 60 

CGAAAGGCTC AGTCGAAAGA CTGGGCCTTT CGTTTTATCT GTTGTTTGTC GGTGAACGCT 120 

CTCCTGAGTA GGACAAATCC GCCGGGAGCG GATTTGAACG TTGCGAAGCA ACGGCCCGGA 180 

GGGTGGCGGG CAGGACGCCC GCCATAAACT GCCAGGCATC AAATTAAGCA GAAGGCCATC 2 40 

CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTTTGT TTATTTTTCT AAATACATTC 300 

AAATATGGAC GTCGTACTTA ACTTTTAAAG TATGGGCAAT CAATTGCTCC TGTTAAAATT 3 60 

GCTTTAGAAA TACTTTGGCA GCGGTTTGTT GTATTGAGTT TCATTTGCGC ATTGGTTAAA ^ 420 

TGGAAAGTGA CCGTGCGCTT ACTACAGCCT AATATTTTTG AAATATCCCA AGAGCTTTTT 4 80 

CCTTCGCATG CCCACGCTAA ACATTCTTTT TCTCTTTTGG TTAAATCGTT GTTTGATTTA 540 

TTATTTGCTA TATTTATTTT TCGATAATTA TCAACTAGAG AAGGAACAAT TAATGGTATG 600 

TTCATACACG CATGTAAAAA T AAACT ATCT ATATAGTTGT CTTTCTCTGA ATGTGCAAAA 660 



CTAAGCATTC CGAAGCCATT ATTAGCAGTA TGAATAGGGA AACTAAACCC AGTGATAAGA 120 

CCTGATGATT TCGCTTCTTT AATTACATTT GGAGATTTTT TATTTACAGC ATTGTTTTCA 7 80 

30 AATATATTCC AATTAATCGG TGAATGATTG GAGTTAGAAT AATCT AC T AT AGGATCATAT 8 40 

TTTATTAAAT TAGCGTCATC ATAATATTGC CTCCATTTTT TAGGGTAATT ATCCAGAATT 9.00 

GAAATATCAG ATTTAACCAT AGAATGAGGA TAAATGATCG CGAGTAAATA ATATTCACAA 960 

35 

TGTACCATTT TAGTCATATC AG AT AAGC AT TGATTAATAT CATTATTGCT TCTACAGGCT 1020 

TTAATTTTAT TAATTATTCT GTAAGTGTCG TCGGCATTTA TGTCTTTCAT ACCCATCTCT 1080 

40 TTATCCTTAC CTATTGTTTG TCGCAAGTTT TGCGTGTTAT ATATCATTAA AACGGTAATA 1140 

GAT TG AC AT T TGATTCTAAT AAATTGGATT TTTGTC ACAC TATTATATCG CTTGAAATAC 1200 

AATTGTTTAA CAT AAGT ACC TGTAGGATCG TACAGGTTTA CGCAAGAAAA TGGTTTGTTA 12 60 

TAGTCGATTA ATCGATTTGA TTCTAGATTT GTTTTAACTA ATTAAAGGAG GAATAACATA 1320 

TGGTTAACGC GTTGGAATTC GAGCTCACTA GTGTCGACCT GCAGGGTACC ATGGAAGCTT 1380 

ACTCGAGGAT CCGCGGAAAG AAGAAGAAGA AGAAGAAAGC CCGAAAGGAA GCTGAGTTGG 1440 

CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG GGCCTCTAAA CGGGTCTTGA 1500 
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GGGGTTTTTT GCTGAAAGGA GGAACCGCTC TTCACGCTCT TCACGC 
(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 
TATGAAACAT CATCACCATC ACCATCATGC TAGCGTTAAC GCGTTGG 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
AATTCCAACG CGTTAACGCT AGCATGATGG TGATGGTGAT GATGTTTCA 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<EP 0784093A1_I_> 



82 



EP 0 784 093 A1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
CTAATTCCGC TCTCACCTAC CAAACAATGC CCCCCTGCAA AAAATAAATT CATATAAAAA 
ACATACAGAT AACCATCTGC GGTGATAAAT TATCTCTGGC GGTGTTGACA TAAATACCAC 
TGGCGGTGAT ACTGAGCACA T 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CGATGTGCTC AGTATCACCG CCAGTGGTAT TTATGTCAAC ACCGCCAGAG ATAATTTATC 

V 

ACCGCAGATG GTTATCTGTA TGTTTTTTAT ATGAATTTAT TTTTTGCAGG GGGGCATTGT 
TTGGTAGGTG AGAGCGGAAT TAGACGT 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CGATTTGATT CTAGAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GGTAC 
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(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

CGAATTCCAA CGCGTTAACC ATATGTTATT CCTCCTTCTA GAATCAAAT 4 9 



20 



25 



30 



50 



(2) INFORMATION FOR SEQ ID NO : 57 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear* 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

35 GTGAAGAGCG TGAAGAGCGG TTCCTCCTTT CAGCAAAAAA CCCCTCAAGA CCCGTTTAGA ' 60 

GGCCCCAAGG GGTTATGCTA GTTATTGCTC AGCGGTGGCA GCAGCCAACT CAGCTTCCTT 120 

TCGGGCTTTC TTCTTCTTCT TCTTCTTTCC GCGGATCCTC GAGTAAGCTT CCATGGTACC 180 

40 

CTGCAGGTCG ACACTAGTGA GCTCGAATTC CAACGCGTTA ACCATATGTT ATTCCTCCTT 240 

TAATTAGTTA ACTCAAATCT AGAATCAAAT CGATAAATTG TGAGCGCTCA CAATTGAGAA 300 

45 TATTAATCAA GAATTTTAGC ATTTGTCAAA TGAATTTTTT AAAAATTATG AGACGTCCAT 3 60 

ATTTGAATGT ATTT AGAAAA ATAAACAAAA GAGTTTGTAG AAACGCAAAA AGGCCATCCG 420 

TCAGGATGGC CTTCTGCTTA ATTTGATGCC TGGCAGTTTA TGGCGGGCGT CCTGCCCGCC 480 

ACCCTCCGGG CCGTTGCTTC GCAACGTTCA AATCCGCTCC CGGCGGATTT GTCCTACTCA 540 

GGAGAGCGTT CACCGACAAA CAACAGATAA AACGAAAGGC CCAGTCTTTC GACTGAGCCT 600 



55 



84 



BNSDOCID: <EP 0784093 A1_l_> 



EP 0 784 093 A1 



TTCGTTTTAT TTGATGCCTG GCAGTTCCCT ACTCTCGCAT GGGGAGACCA TGCATACGTT 6 60 

ACGCACGT ,6 68 

5 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GCGTAACGTA TGCATGGTCT CCCCATGCGJ^ GAGTAGGGAA CTGCCAGGCA TCAAATAAAA 60 

CGAAAGGCTC AGTCGAAAGA CTGGGCCTT.T CGTTTTATCT GTTGTTTGTC GGTGAACGCT 120 

V 

CTCCTGAGTA GGACAAATCC GCCGGGAGC.G GATTTGAACG TTGCGAAGCA ACGGCCCGGA 180 

GGGTGGCGGG CAGGACGtCC GCCATAAACT GCCAGGCATC AAATTAAGCA GAAGGGGCCT 2 40. 

CCCACCGCCC GTCCTGCGGG CGGTATTTGA CGGTCCGT AG TTTAATTCGT CTTCGCCATC 300 

CTGACGGATG GCCTTTTTGC GTTTCTACAA ACTCTTTTGT TTATTTTTCT AAATACATTC 3 60 

AAATATGGAC GTCTCATAAT TTTTAAAAAA TTCATTTGAC AAATGCTAAA ATTCTTGATT 420 

AATATTCTCA ATTGTGAGCG CTCACAATTT ATCGATTTGA TTCTAGATTT GTTTTAACTA 480 

ATTAAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GAGCTCACTA GTGTCGACCT 5 40 

GCAGGGTACC ATGGAAGCTT ACTCGAGGAT CCGCGGAAAG AAGAAGAAGA AGAAGAAAGC 600 

CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG 6 60 

GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACCGCTC TTCACGCTCT 720 

TCACGC 72 6 
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(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 4 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: CDNA 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

m 

TACGCACTGG ATCCTTATAA GCAGCTTATT TTTACTGATT GGAC 44 
(2) INFORMATION FOR SEQ ID NO: 60: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic ac^ti 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear*- 

(ii) MOLECULE TYPE: CDNA 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GTCCTCCTGG TACCTACCTA AAACAAC 27 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
50 TATGGATGAA GAAACTTCTC ATCAGCTGCT GTGTGATAAA TGTCCGCCGG GTACCCGGCG 60 
GACATTTATC ACACAGCAGC TGATGAGAAG TTTCTTCATC CA 102 

55 



35 



40 



86 



BNSDOCID: <EP 0784093A1_I_> 



EP 0 784 093 A1 



10 



15 



20 



40 



45 



SO 



55 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Asp Glu Glu Thr Ser His Gin Leu Leu Cys Asp Lys Cys Pro Pro 
1 5 10 15 

Gly Thr Tyr 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
2S (A) LENGTH: 84 base gairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: cDNA 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

TATGGAAACT TTTCCTCCAA AATATCTTCA TTATGATGAA GAAACTTCTC ATCAGCTGCT 60 
GTGTGATAAA TGTCCGCCGG GTAC 8 4 

(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CCGGCGGACA TTTATCACAC AGCAGCTGAT GAGAAGTTTC TTCATCATAA TGAAGATATT 
TTGGAGGAAA AGTTTCCA 

(2) INFORMATION FOR SEQ ID NO: 65: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: JEQ ID NO: 65: 
TACGCACTGG ATCCTTATAA GCAGCTTATT TTCACGGATT GAAC 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GTGCTCCTGG TACCTACCTA AAACAGCACT GCACAGTG 
(2) INFORMATION FOR SEQ ID NO : 67 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
TATGGAAACT CTGCCTCCAA AATACCTGCA TTACGATCCG GAAACTGGTC ATCAGCTGCT 60 
GTGTGATAAA TGTGCTCCGG GTAC 8 4 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 68: 
CCGGAGCACA TTTATCACAC AGCAGCTGAT GACCAGTTTC CGGATCGTAA TGCAGGTATT 60 
TTGGAGGCAG AGTTTCCA "7 8 

(2) INFORMATION FOR SEQ ID NO: 69: 

30 (i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

TATGGACCCA GAAACTGGTC ATCAGCTGCT GTGTGATAAA TGTGCTCCGG GTAC 54 
(2) INFORMATION FOR SEQ ID NO:70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CCGGAGCACA TTTATCACAC AGCAGCTGAT GACCAGTTTC TGGGTCCA 
(2) INFORMATION FOR SEQ ID NO : 7 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

k 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 71: 
TATGAAAGAA ACTCTGCCTC CAAAATACCT GCATTACGAT CCGGAAACTG GTCATCAGCT 
GCTGTGTGAT AAATGTGCTC CGGGTAC 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CCGGAGCACA TTTATCACAC AGCAGCTGAT GACCAGTTTC CGGATCGTAA TGCAGGTATT 
TTGGAGGCAG AGTTTCTTTC A 
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(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GTTCTCCTCA TATGAAACAT CATCACCATC ACCATCATGA AACTCTGCCT CCAAAATACC 
TGCATTACGA T 

(2) INFORMATION FOR SEQ ID NO: 7 4: 
(i) SEQUENCE CHARACTERISTICS: 



(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
GTTCTCCTCA TATGAAAGAA ACTCTGCCTC CAAAATACCT GCA 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(A) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
TACGCACTGG ATCCTTAATG ATGGTGATGG TGATGATGTA AGCAGCTTAT TTTCACGGAT 
TGAACCTGAT TCCCTA 

(2) INFORMATION FOR SEQ ID NO : 7 6 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: §EQ ID NO:76: 
GTTCTCCTCA TATGAAATAC CTGCATTACG ATCCGGAAAC TGGTCAT 
(2) INFORMATION FOR SEQ ID NO: 7.7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 
GTTCTCCTAT TAATGAAATA TCTTCATTAT GATGAAGAAA CTT 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
TACGCACTGG ATCCTTATAA GCAGCTTATT TTTACTGATT 
(2) INFORMATION FOR SEQ ID NO : 7 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SfcQ ID NO:79: 
GTTCTCCTCA TATGGAAACT CTGCCTCC^V AATACCTGCA 
(2) INFORMATION FOR SEQ ID NO:&0: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TACGCACTGG ATCCTTATGT TGCATTTCCT TTCTGAATTA GCA 
(2) INFORMATION FOR SEQ ID NO: 81: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 
CCGGAAACAG ATAATGAG 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: JEQ ID NO: 82 
GATCCTCATT ATCTGTTT 

if 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 

CCGGAAACAG AGAAGCCACG CAAAAGTAAG 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 
GATCCTTACT TTTGCGTGGC TTCTCTGTTT 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 
TATGTTAATG AG 

(2) INFORMATION FOR SEQ ID NO : & : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 
GATCCTCATT AACA 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
TATGTTCCGG AAACAGTTAA G 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
GATCCTTAAC TGTTTCCGGA ACA V 
(2) INFORMATION FOR SEQ ID NO:*^: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 
TATGTTCCGG AAACAGTGAA TCAACTCAAA AATAAG 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
GATCCTTATT TTTGAGTTGA TTCACTGTTT CCGGAACA 38 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



20 ^ 

(xi) SEQUENCE DESCRIPTION: 5EQ ID NO: 91: 
CTAGCGACGA CGACGACAAA GAAACTCTGC CTCCAAAATA CCTGCATTAC GATCCGGAAA 60 

25 

CTGGTCATCA GCTGCTGTGT GATAAATGTG CTCCGGGTAC 100 
(2) INFORMATION FOR SEQ ID NO: 92: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
CCGGAGCACA TTTATCACAC AGCAGCTGAT GACCAGTTTC CGGATCGTAA TGCAGGTATT 60 
TTGGAGGCAG AGTTTCTTTG TCGTCGTCGT CG 92 
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(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
ACAAACACAA TCGATTTGAT ACTAGA 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
TTTGTTTTAA CTAATTAAAG GAGGAATAAA ATATGAGAGG ATCGCATCAC 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
CATCACCATC ACGAAACCTT CCCGCCGAAA TACCTGCACT ACGACGAAGA 
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(2) INFORMATION FOR SEQ ID NO : 9 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
AACCTCCCAC CAGCTGCTGT GCGACAAATG CCCGCCGGGT ACCCAAACA 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTIC^: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic aci$ . 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear * 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
TGTTTGGGTA CCCGGCGGGC ATTTGT 
<2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CGCACAGCAG CTGGTGGGAG GTTTCTTCGT CGTAGTGCAG GTATTTCGGC 



99 

_0784093A1_I_> 



EP 0 784 093 A1 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GGGAAGGTTT CGTGATGGTG ATGGTGATGC GATCCTCTCA TATTTTATT 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear* 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
CCTCCTTTAA TTAGTTAAAA CAAATCTAGT ATCAAATCGA TTGTGTTTGT 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l0l: 
* ACAAACACAA TCGATTTGAT ACTAGATTTG .TTTTAACTAA TTAAAGGAGG AATAAAATG 
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(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CTAATTAAAG GAGGAATAAA AT G AAAG AAA CTTTTCCTCC AAAATATC 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 



(D) TOPOLOGY: linear,- 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

TGTTTGGGTA CCCGGCGGAC ATTTATCACA C 

<2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 5 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 104: 
ACAAACACAA TCGATTTGAT ACTAGATTTG TTTTAACTAA TTAAAGGAGG AATAAAATG 
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(2) INFORMATION FOR SEQ ID NO: 10 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
CTAATTAAAG GAGGAATAAA ATGAAAAAAA AAGAAACTTT TCCTCCAAAA 
(2) INFORMATION FOR SEQ ID NO: 10 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic *c£d 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear.; 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 106: 
TGTTTGGGTA CCCGGCGGAC ATTTATCACA C 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
CAGCCCGGGT AAAATGGAAA CGTTTCCTCC AAAATATCTT CATT 
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(2) INFORMATION FOR SEQ ID NO: 108: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: 

CGTTTCCATT TTACCCGGGC TGAGCGAGAG GCTCTTCTGC GTGT 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear^ 

(ii) MOLECULE TYPE: cDNA > 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
CGCTCAGCCC GGGTAAAATG GAAACGTTGC CTCCAAAATA CCTGC 
(2) INFORMATION FOR SEQ ID NO:110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
CCATTTTACC CGGGCTGAGC GAGAGGCTCT TCTGCGTGT 
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(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 36 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
GAAAATAAGC TGCTTAGCTG CAGCTGAACC AAAATC 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic ac^d 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear-; 

(ii) MOLECULE TYPE: cDNA ^ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
CAGCTGCAGC TAAGCAGCTT ATTTTCACGG ATTG 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:113 
AAAAATAAGC TGCTTAGCTG CAGCTGAACC AAAATC 
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(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
CAGCTGCAGC TAAGCAGCTT ATTTTTACTG ATTGG 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base fjairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

V 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
CTAGAAGGAG GAATAACATA TGGAAACTTT TGCTCCAAAA TATCTTCATT ATGATGAAGA 
AACTAGTCAT CAGCTGCTGT GTGATAAATG TCCGCCGGGT AC 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
CCGGCGGACA TTTATCACAC AGCAGCTGAT GACTAGTTTC TTCATCATAA TGAAGATATT 

5 

TTGGAGCAAA AGTTTCCATA TGTTATTCCT CCTT 
(2) INFORMATION FOR SEQ ID NO: 117: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: CDNA 



20 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:117: 
CTAGAAGGAG GAATAACATA TGGAAACT^T TCCTGCTAAA TATCTTCATT ATGATGAAGA 6 0 



62 



25 AA > 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 62 base pairs 

<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: CDNA 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 

CTAGTTTCTT CATCATAATG AAGATATTTA GCAGGAAAAG TTTCCATATG TTATTCCTCC 60 

62 



TT 

(2) INFORMATION FOR SEQ ID NO: 119 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



55 



106 



BNSDOCID: <EP 0784O93A1_l_> 



EP 0 784 093 A1 



10 



is 



20 



25 



30 



35 



40 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Tyr His Tyr Tyr Asp Gin Asn Gly Arg Met Cys Glu Glu Cys His Met 
15 10 15 

Cys Gin Pro Gly His Phe Leu Val Lys His Cys Lys Gin Pro Lys Arg 
20 25 30 

Asp Thr Val Cys His Lys Pro Cys Glu Pro Gly Val Thr Tyr Thr Asp 
35 40 45 

Asp Trp His 
50 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTIC^ : 

(A) LENGTH: 2 432 base pairs 

(B) TYPE: nucleic acy 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear-" 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 124.. 1326 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
ATCAAAGGCA GGGCATACTT CCTGTTGCCC AGACCTTATA TAAAACGTCA TGTTCGCCTG 60 
GGCAGCAGAG AAGCACCTAG CACTGGCCCA GCGGCTGCCG CCTGAGGTTT CCAGAGGACC 120 



ACA ATG AAC AAG TGG CTG TGC TGT GCA CTC CTG GTG TTC TTG GAC ATC 168 

Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Val Phe Leu Asp lie 
45 \ 5 10 15 

ATT GAA TGG ACA ACC CAG GAA ACC TTT CCT CCA AAA TAC TTG CAT TAT - 216 

lie Glu Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr 
20 25 30 

so 
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AGA TGT CCG GAT GGG TTC TTC TCA GGT GAG ACG TCA TCG AAA GCA CCC 
Arg Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro 
145 150 155 



GAC CCA GAA ACC GGA CGT CAG CTC TTG TGT G AC AAA TGT GCT CCT GGC 2 64 

Asp Pro Glu Thr Gly Arg Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly 

35 40 45 

5 

ACC TAC CTA AAA CAG CAC TGC ACA GTC AGG AGG AAG ACA CTG TGT GTC 312 

Thr Tyr Leu Lys Gin His Cys Thr Val Arg Arg Lys Thr Leu Cys Val 

50 55 60 

10 CCT TGC CCT GAC TAC TCT TAT ACA GAC AGC TGG CAC ACG AGT GAT GAA 360 

Pro Cys Pro Asp Tyr Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu 
65 70 75 

TGC GTG TAC TGC AGC CCC GTG TGC AAG GAA CTG CAG ACC GTG AAA CAG 408 
Cys Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Thr Val Lys Gin 
80 J 85 90 95 

GAG TGC AAC CGC ACC CAC AAC CGA GTG TGC GAA TGT GAG GAA GGG CGC 45 6 

Glu Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg 
100 * .105 110 

4» 

TAC CTG GAG CTC GAA TTC TGC TTG AAG CAC CGG AGC TGT CCC CCA GGC 50 4 

Tyr Leu Glu Leu Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly 
115 *" 120 125 

TTG GGT GTG CTG CAG GCT GGG ACC CCA GAG CGA AAC ACG GTT TGC AAA 552 
Leu Gly Val Leu Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys 
130 135 140 



600 



TGT AGG AAA CAC ACC AAC TGC AGC TCA CTT GGC CTC CTG CTA ATT CAG 648 
Cys Arg Lys His Thr Asn Cys Ser Ser Leu Gly Leu Leu Leu lie Gin 
160 165 170 175 

AAA GGA AAT GCA ACA CAT GAC AAT GTA TGT TCC GGA AAC AGA GAA GCA 696 
Lys Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala 
180 185 190 



ACT CAA AAT TGT GGA ATA GAT GTC ACC CTG TGC GAA GAG GCA TTC TTC 744 
Thr Gin Asn Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe 
45 195 200 205 

AGG TTT GCT GTG CCT ACC AAG ATT ATA CCG AAT TGG CTG AGT GTT CTG 7 92 

Arg Phe Ala Val Pro Thr Lys lie lie Pro Asn Trp Leu Ser Val Leu 
210 215 220 

GTG GAC AGT TTG CCT GGG ACC AAA GTG AAT GCA GAG AGT GTA GAG AGG 840 
Val Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg 
225 230 235 
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ATA AAA CGG AGA CAC AGC TCG CAA GAG CAA ACT TTC CAG CTA CTT AAG 88 8 

lie Ly3 Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys 
240 245 250 255 

CTG TGG AAG CAT CAA AAC AGA GAC CAG GAA ATG GTG AAG AAG ATC ATC 93 6 

Leu Trp Lys His Gin Asn Arg Asp Gin Glu Met Val Lys Lys lie lie 

260 265 270 

CAA GAC ATT GAC CTC TGT GAA AGC AGT GTG CAA CGG CAT ATC GGC CAC 98 4 

Gin Asp lie Asp Leu Cys Glu Ser Ser Val Gin Arg His lie Gly His 
275 280 285 

GCG AAC CTC ACC ACA GAG CAG CTC CGC ATC TTG ATG GAG AGC TTG CCT 1032 

Ala Asn Leu Thr Thr Glu Gin Leu Arg lie Leu Met Glu Ser Leu Pro 
290 295 300 

GGG AAG AAG ATC AGC CCA GAC GAG ATT GAG AGA ACG AGA AAG ACC TGC 10 8 0 

Gly Lys Lys lie Ser Pro Asp Glu lie Glu Arg Thr Arg Lys Thr Cys 
305 310 315 

AAA CCC AGC GAG CAG CTC CTG AAG CTA CTG AGC TTG TGG AGG ATC AAA 1128 

Lys Pro Ser Glu Gin Leu Leu Lys- Leu Leu Ser Leu Trp Arg lie Lys 
320 325 330 335 

AAT GGA GAC CAA GAC ACC . TTG AAG GGC CTG ATG TAC GCA CTC AAG CAC 117 6 

Asn Gly Asp Gin Asp Thr Leu Lys Gly Leu Met Tyr Ala Leu Lys His 

340 345 350 

TTG AAA GCA TAC CAC TTT CCC AAA ACC GTC ACC CAC AGT CTG AGG AAG 122 4 

Leu Lys Ala Tyr His Phe Pro Lys Thr Val Thr His Ser Leu Arg Lys 
355 360 365 

35 ACC ATC AGG TTC TTG CAC AGC TTC ACC ATG TAC CGA TTG TAT CAG AAA 127 2 

Thr lie Arg Phe Leu His Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys 
370 375 380 
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CTC TTT CTA GAA ATG ATA GGG AAT CAG GTT CAA TCA GTG AAG ATA AGC 1320 
Leu Phe Leu Glu Met lie Gly Asn Gin Val Gin Ser Val Lys lie Ser 
385 390 395 

TGC TTA TAGTTAGGAA TGGTCACTGG GCTGTTTCTT CAGGATGGGC CAACACTGAT 137 6 

Cys Leu 

400 

GGAGCAGATG GCTGCTTCTC CGGCTCTTGA AATGGCAGTT GATTCCTTTC TCATCAGTTG 1436 

GTGGGAATGA AGATCCTCCA GCCCAACACA CACACTGGGG AGTCTGAGTC AGGAGAGTGA 149 6 

GGCAGGCTAT TTGATAATTG TGCAAAGCTG CCAGGTGTAC ACCTAGAAAG TCAAGCACCC 1556 
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TGAGAAAGAG GATATTTTTA TAACCTCAAA CATAGGCCCT TTCCTTCCTC TCCTTATGGA 1616 

TGAGTACTCA GAAGGCTTCT ACTATCTTCT GTGTCATCCC TAGATGAAGG CCTCTTTTAT 167 6 

TTATTTTTTT ATTCTTTTTT TCGGAGCTGG GGACCGAACC CAGGGCCTTG CGCTTGCGAG 1736 

GCAAGTGCTC TACCACTGAG CTAAATCTCC AACCCCTGAA GGCCTCTTTC TTTCTGCCTC 17 96 

TGATAGTCTA TGACATTCTT TTTTCTACAA TTCGTATCAG GTGCACGAGC CTTATCCCAT 1856 

TTGTAGGTTT CTAGGCAAGT TGACCGTTAG CTATTTTTCC CTCTGAAGAT TTGATTCGAG 1916 

TTGCAGACTT GGCTAGACAA GCAGGGGTAG GTTATGGTAG TTTATTTAAC AGACTGCCAC 197 6 

CAGGAGTCCA GTGTTTCTTG TTCCTCTGTA GTTGTACCTA AGCTGACTCC AAGTACATTT 2 036 

AGTATGAAAA ATAATCAACA AATTTTATTC CTTCTATCAA CATTGGCTAG CTTTGTTTCA 2 096 

20 GGGCACTAAA AGAAACTACT ATATGGAGAA AGAATTGATA TTGCCCCCAA CGTTCAACAA 2156 

J 

CCCAATAGTT TATCCAGCTG TCATGCCTGG TTCAGTGTCT ACTGACTATG CGCCCTCTTA 2216 

TTACTGCATG CAGTAATTCA ACTGGAAATA GTAATAATAA TAATAGAAAT AAAATCTAGA 227 6 



10 



15 



25 



35 



40 



45 



v 



CTCCATTGGA TCTCTCTGAA TATGGGAATA TCTAACTTAA GAAGCTTTGA GATTTCAGTT 2336 
GTGTTAAAGG CTTTTATTAA AAAGCTGATG CTCTTCTGTA AAAGTTACTA ATATATCTGT 2 396 

AAGACTATTA CAGTATTGCT ATTTATATCC ATCCAG 2432 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Val Phe Leu Asp He He 
so x 5 10 15 

Glu Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 
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Pro Glu Thr Gly . Arg Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr 
35 40 45 

5 

Tyr Leu Lys Gin His Cys Thr Val Arg Arg Lys Thr Leu Cys Val Pro 
50 55 60 

Cys Pro Asp Tyr Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
10 65 70 75 80 

Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Thr Val Lys Gin Glu 
85 90 95 

75 Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr 

100 105 110 

Leu Glu Leu Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Leu 
115 120 125 



20 



25 



30 



35 



40 



45 



50 



55 



Gly Val Leu Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 J 140 

Cys Pro Asp Gly Phe Phe Ser Gl$f Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 . 155 160 

Arg Lys His Thr Asn Cys Ser Ser Leu Gly Leu Leu Leu lie Gin Lys 
165 170 175 

Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr 
180 185 190 

Gin Asn Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 

Phe Ala Val Pro Thr Lys lie He Pro Asn Trp Leu Ser Val Leu Val 
210 215 220 

Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg He 
225 230 235 240 

Lys Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
245 250 255 

Trp Lys His Gin Asn Arg Asp Gin Glu Met Val Lys Lys He lie Gin 
260 265 270 

Asp lie Asp Leu Cys Glu Ser Ser Val Gin Arg His lie Gly His Ala 
275 280 285 

Asn Leu Thr Thr Glu Gin Leu Arg lie Leu Met Glu Ser Leu Pro Gly 
290 295 300 
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Lys Lys He Ser Pro Asp Glu He Glu Arg Thr Arg Lys Thr Cys Lys 
305 310 315 

Pro Ser Glu Gin Leu Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn 
325 330 335 

Gly Asp Gin Asp Thr Leu Lys Gly Leu Met Tyr Ala Leu Lys His Leu 
10 340 345 350 

Lys Ala Tyr His Phe Pro Lys Thr Val Thr His Ser Leu Arg Lys Thr 
355 360 365 



15 



20 



25 



30 



35 



40 



45 



lie Arg Phe Leu His Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys Leu 
370 375 380 



Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys 
385 

Leu 



390 395 400 



(2) INFORMATION FOR SEQ ID NO: 122: 

{ i) SEQUENCE * CHARACTERISTICS : 

(A) LENGTH: 1324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 90.. 12 92 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
CCTTATATAA ACGTCATGAT TGCCTGGGCT GCAGAGACGC ACCTAGCACT GACCCAGCGG 

CTGCCTCCTG AGGTTTCCCG AGGACCACA ATG AAC AAG TGG CTG TGC TGC GCA 

Met Asn Lys Trp Leu Cys Cys Ala 
1 5 



50 CTC CTG GTG CTC CTG GAC ATC ATT GAA TGG ACA ACC CAG GAA ACC CTT 

' Lu Leu val Leu Leu Asp He He Glu Trp Thr Thr. Gin Glu Thr Leu 
10 15 20 
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CCT CCA AAG TAC TTG CAT TAT GAC CCA GAA ACT GGT CAT CAG CTC CTG 
Pro Pro Lys Tyr Leu His Tyr Asp Pro Glu Thr Gly His Gin Leu Leu 
25 30 35 40 

TGT GAC AAA TGT GCT CCT GGC ACC TAC CTA AAA CAG CAC TGC ACA GTG 
Cys Asp Lys Cys Ala Pro Gly Thr Tyr Leu Lys Gin His Cys Thr Val 
45 50 55 

AGG AGG AAG ACA TTG TGT GTC CCT TGC CCT GAC CAC TCT TAT ACG GAC 
Arg Arg Lys Thr Leu Cys Val Pro Cys Pro Asp His Ser Tyr Thr Asp 
60 65 70 

AGC TGG CAC ACC AGT GAT GAG TGT GTG TAT TGC AGC CCA GTG TGC - AAG 
Ser Trp His Thr Ser Asp Glu Cys Val Tyr Cys Ser Pro Val Cys Lys 
75 80 85 

GAA CTG CAG TCC GTG AAG CAG GAG TGC AAC CGC ACC CAC AAC CGA GTG 
Glu Leu Gin Ser Val Lys Gin Glu Cys Asn Arg Thr His Asn Arg Val 
90 95 100 

TGT GAG TGT GAG GAA GGG CGT TAC £TG GAG ATC GAA TTC TGC TTG AAG 
Cys Glu Cys Glu Glu Gly Arg Tyr Leu Glu He Glu Phe Cys Leu Lys 
105 HO ■ 115 120 

CAC CGG AGC TGT CCC CCG GGC TCC. -GGC GTG GTG CAA GCT GGA ACC CCA 
His Arg Ser Cys Pro Pro Gly Ser v Gly Val Val Gin Ala Gly Thr Pro 
125 130 135 

GAG CGA AAC ACA GTT TGC AAA AAA TGT CCA GAT GGG TTC TTC TCA GGT 
Glu Arg Asn Thr Val Cys Lys Lys Cys Pro Asp Gly Phe Phe Ser Gly 
140 145 150 

GAG ACT TCA TCG AAA GCA CCC TGT ATA AAA CAC ACG AAC TGC AGC ACA 
Glu Thr Ser Ser Lys Ala Pro Cys ile Lys His Thr Asn Cys Ser Thr 
155 160 165 

TTT GGC CTC CTG CTA ATT CAG AAA GGA AAT GCA ACA CAT GAC AAC GTG 
Phe Gly Leu Leu Leu Ile Gin Lys Gly Asn Ala Thr His Asp Asn Val 
170 175 180 

TGT TCC GGA AAC AGA GAA GCC ACG CAA AAG TGT . GGA ATA GAT GTC ACC 
Cys Ser Gly Asn Arg Glu Ala Thr Gin Lys Cys Gly Ile Asp Val Thr 
185 190 195 200 

CTG TGT GAA GAG GCC TTC TTC AGG TTT GCT GTT CCT ACC AAG ATT ATA 
Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr Lys Ile Ile 
205 210 215 
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10 



15 



25 



CCA AAT TGG CTG AGT GTT TTG GTG GAC AGT TTG CCT GGG ACC AAA GTG "7 85 

Pro Asn Trp Leu Ser val Leu Val Asp Ser Leu Pro Gly Thr Lys Val 
220 225 230 

AAT GCC GAG AGT GTA GAG AGG ATA AAA CGG AGA CAC AGC TCA CAA GAG 833 
Asn Ala Glu Ser Val Glu Arg He Lys Arg Arg His Ser Ser Gin Glu 
235 240 245 

CAA ACC TTC CAG CTG CTG AAG CTG TGG AAA CAT CAA AAC AGA GAC CAG 881 
Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn Arg Asp Gin 
250 255 260 

GAA ATG GTG AAG AAG ATC ATC CAA GAC ATT GAC CTC TGT GAA AGC AGC 92 9 

Glu Met Val Lys Lys He He Gin Asp He Asp Leu Cys Glu Ser Ser 
265 270 275 280 

GTG CAG CGG CAT CTC GGC CAC TCG AAC CTC ACC ACA GAG CAG CTT CTT 97 7 

Val Gin Arg His Leu Gly His Ser Asn Leu Thr Thr Glu Gin Leu Leu 
20 285 290 295 

GCC TTG ATG GAG AGC CTG CCT GGG^AAG AAG ATC AGC CCA GAA GAG ATT 1025 
Ala Leu Met Glu Ser Leu Pro Gly Lys Lys He Ser Pro Glu Glu He 
300 • 305 310 

k 

GAG AGA ACG AGA AAG ACC TGC AAA TCG AGC GAG CAG CTC CTG AAG CTA 107 3 

Glu Arg Thr Arg Lys Thr Cys Lyl Ser Ser Glu Gin Leu Leu Lys Leu 
315 320 325 

30 CTC AGT TTA TGG AGG ATC AAA AAT GGT GAC CAA GAC ACC TTG AAG GGC 1121 

Leu Ser Leu Trp Arg He Lys Asn Gly Asp Gin Asp Thr Leu Lys Gly 
330 335 340 

CTG ATG TAT GCC CTC AAG CAC TTG AAA ACA TCC CAC TTT CCC AAA ACT 1169 
35 Leu Met Tyr Ala Leu Lys His Leu Lys Thr Ser His Phe Pro Lys Thr 

345 350 355 360 

GTC ACC CAC AGT CTG AGG AAG ACC ATG AGG TTC CTG CAC AGC TTC ACA 1217 
Val Thr His Ser Leu Arg Lys Thr Met Arg Phe Leu His Ser Phe Thr 
40 365 370 375 

ATG TAC AGA CTG TAT CAG AAG CTC TTT TTA GAA ATG ATA GGG AAT CAG 12 65 

Met Tyr Arg Leu Tyr Gin Lys Leu Phe Leu Glu Met He Gly Asn Gin 
380 385 390 

45 

GTT CAA TCC GTG AAA ATA AGC TGC TTA T AACT AGG AA TGGTCACTGG 1312 
Val Gin Ser Val Lys He Ser Cys Leu 
395 400 

50 GCTGTTTCTT CA 132 4 
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(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Met Asn Lys Trp Leu Cys Cy3 Ala Leu Leu Val Leu Leu Asp lie lie 
15 10 15 

Glu Trp Thr Thr Gin Glu Thr Leu Pro Pro Lys Tyr Leu His Tyr Asp 
20 2 5 30 

Pro Glu Thr Gly His Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr 
35 " 40, 45 

2S Tyr Leu Lys Gin His Cys Thr VaL Arg Arg Lys Thr Leu Cys Val Pro 

50 55 * 60 

Cys Pro Asp His Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 70 75 80 

30 

Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Ser Val Lys Gin Glu 
85 90 95 

Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr 
100 105 110 

Leu Glu lie Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Ser 
115 120 125 

40 Gly Val Val Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Lys 

130 135 140 

Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 155 160 

45 

lie Lys His Thr Asn Cys Ser Thr Phe Gly Leu Leu Leu lie Gin Lys 
165 170 175 

Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr 
50 180 185 190 

--Gin Lys Cys Gly He Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 

55 
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Phe Ala val Pro Thr Lys He He Pro Asn Trp Leu Ser Val Leu Val 
210 215 220 

Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie 
225 230 235 240 

Lys Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
245 250 255 

Trp Lys His Gin Asn Arg Asp Gin Glu Met Val Lys Lys He He Gin 
260 265 270 

Asd He Asp Leu Cys Glu Ser Ser Val Gin Arg His Leu Gly His Ser 
275 280 285 

Asn Leu Thr Thr Glu Gin Leu Leu Ala Leu Met Glu Ser Leu Pro Gly 
290 295 300 

Lys Lys He Ser Pro Glu Glu He Glu Arg Thr Arg Lys Thr Cys Lys 
305 310 j 315 320 

Ser Ser Glu Gin Leu Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn 
325 v 330 335 

Gly Asp Gin Asp Thr Leu Lys Gly Leu Met Tyr Ala Leu Lys His Leu 
340 345 350 

Lys Thr Ser His Phe Pro Lys Thr Val Thr His Ser Leu Arg Lys Thr 
355 360 365 

Met Arg Phe Leu His Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys Leu 
370 375 380 

Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys 
385 390 395 400 

Leu 



(2) INFORMATION FOR SEQ ID. NO: 12 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1355 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 94.. 1296 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 4: 

GTATATATAA CGTGATGAGC GTACGGGTGC GGAGACGCAC CGGAGCGCTC GCCCAGCCGC 

CGCTCCAAGC CCCTGAGGTT TCCGGGGACC ACA ATG AAC AAG TTG CTG TGC TGC 

Met Asn Lys Leu Leu Cys Cys 
1 5 

GCG CTC GTG TTT CTG GAC ATC TCC ATT AAG TGG ACC ACC. CAG GAA ACG 
Ala Leu Val Phe Leu Asp He Ser He Lys Trp Thr Thr Gin Glu Thr 
10 15 20 

TTT CCT CCA AAG TAC CTT CAT TAT GAC GAA GAA ACC TCT CAT CAG CTG 
Phe Pro Pro Lys Tyr Leu His Tyr^Asp Glu Glu Thr Ser His Gin Leu 
25 " ~ 30 35 

TTG TGT GAC AAA TGT CCT CCT Gdt ACC . TAC CTA AAA CAA CAC TGT ACA 
Leu Cys Asp Lys Cys Pro Pro Gly Thr Tyr Leu Lys Gin His Cys Thr 
40 45 . £ 50 55 

GCA AAG TGG AAG ACC GTG TGC GCC CCT TGC CCT GAC CAC TAC TAC ACA 
Ala Lys Trp Lys Thr Val Cys Ala Pro Cys Pro Asp His Tyr Tyr Thr 
60 65 7 0 

GAC AGC TGG CAC ACC AGT GAC GAG TGT CTA TAC TGC AGC CCC GTG TGC 
Asp Ser Trp His Thr Ser Asp Glu Cys Leu Tyr Cys Ser Pro Val Cys 
75 80 8 5 

AAG GAG CTG CAG TAC GTC AAG CAG GAG TGC AAT CGC ACC CAC AAC CGC 
Lys Glu Leu Gin Tyr Val Lys Gin Glu Cys Asn Arg Thr His Asn Arg 
90 95 100 

GTG TGC GAA TGC AAG GAA GGG CGC TAC CTT GAG ATA GAG . TTC TGC TTG 
Val Cys Glu Cys Lys Glu Gly Arg Tyr Leu Glu He Glu Phe Cys Leu 
105 HO 115 

AAA CAT AGG AGC TGC CCT CCT GGA TTT GGA GTG GTG CAA GCT GGA ACC 
Lys His Arg Ser Cys Pro Pro Gly Phe Gly val Val Gin Ala Gly Thr 
120 125 130 135 

CCA GAG CGA AAT ACA GTT TGC AAA AGA TGT CCA GAT GGG TTC TTC TCA 
Pro Glu Arg Asn Thr val Cys Lys Arg Cys Pro Asp Gly Phe Phe Ser 
140 145 150 
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70 



1S 



20 



2S 



30 



3S 



40 



SO 



AAT GAG ACG TCA TCT AAA GCA CCC TGT AGA AAA CAC ACA AAT TGC AGT 59 4 

Asn Glu Thr Ser Ser Lys. Ala Pro Cys Arg Lys His Thr Asn Cys Ser 
155 160 165 

GTC TTT GGT CTC CTG CTA ACT CAG AAA GGA AAT GCA ACA CAC GAC AAC 6 42 

Val Phe Gly Leu Leu Leu Thr Gin Lys Gly Asn Ala Thr His Asp Asn 
170 175 180 



ATA TGT TCC GGA AAC AGT GAA TCA ACT CAA AAA TGT GGA ATA GAT GTT 690 
He Cys Ser Gly Asn Ser Glu Ser Thr Gin Lys Cys Gly He Asp Val 
185 190 195 

ACC CTG TGT GAG GAG GCA TTC TTC AGG TTT GCT GTT CCT ACA AAG TTT 7 38 

Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr Lys Phe „ 
200 ' 205 210 215 

ACG CCT AAC TGG CTT AGT GTC TTG GTA GAC AAT TTG CCT GGC ACC AAA 7 86 

Thr Pro Asn Trp Leu Ser Val Leu .Val Asp Asn Leu Pro Gly Thr Lys 
220 225 230 

GTA AAC GCA GAG AGT GTA GAG AG<i ATA AAA CGG CAA CAC AGC TCA CAA 834 
Val Asn Ala Glu Ser Val Glu Arg. He Lys Arg Gin His Ser Ser Gin 
235 ' ; 240 245 

GAA CAG ACT TTC CAG CTG CTG AAG TTA TGG AAA CAT CAA AAC AAA GCC 882 
Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn Lys Ala 
250 255 260 

CAA GAT ATA GTC AAG AAG ATC ATC CAA GAT ATT GAC CTC TGT GAA AAC 930 
Gin Asp lie Val Lys Lys He He Gin Asp lie Asp Leu Cys Glu Asn 
265 270 275 

AGC GTG CAG CGG CAC ATT GGA CAT GCT AAC CTC ACC TTC GAG CAG CTT 978 
Ser Val Gin Arg His lie Gly His Ala Asn Leu Thr Phe Glu Gin Leu 
280 " 285 290 295 

CGT AGC TTG ATG GAA AGC TTA CCG GGA AAG AAA GTG GGA GCA GAA GAC 102 6 

Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val Gly Ala Glu Asp 
300 305 310 

45 ATT GAA AAA ACA ATA AAG GCA TGC AAA CCC AGT GAC CAG ATC CTG AAG 107 4 

He Glu Lys Thr He Lys Ala Cys Lys Pro Ser Asp Gin He Leu Lys 
315 320 325 

CTG CTC AGT TTG TGG CGA ATA AAA AAT GGC GAC CAA GAC ACC TTG AAG 1122 
Leu Leu Ser Leu Trp Arg lie Lys Asn Gly Asp Gin Asp Thr Leu Lys 
330 335 340 



ss 



118 



BNSDOCID: <EP 0784093A1J_> 



EP 0 784 093 A1 



GGC CTA ATG CAC GCA CTA AAG CAC TCA AAG ACG TAC CAC TTT CCC AAA 117 0 

Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr His Phe Pro Lys 
345 350 355 

5 

ACT GTC ACT CAG AGT CTA AAG AAG ACC ATC AGG TTC CTT CAC AGC TTC 1218 

Thr Val Thr Gin Ser Leu Lys Lys Thr He Arg Phe Leu His Ser Phe 
360 365 370 375 

10 ACA ATG TAC AAA TTG TAT CAG AAG TTA TTT TTA GAA ATG ATA GGT AAC 12 6 6 

Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Giu Met lie Gly Asn 
380 385 390 

CAG GTC CAA TCA GTA AAA ATA AGC TGC TTA TAACTGGAAA TGGCCATTGA 1316 
75 Gin Val Gin Ser Val Lys He Ser Cys Leu 

395 400 

GCTGTTTCCT CACAATTGGC GAGATCCCAT GGATGATAA 1355 



20 



30 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 125: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 
25 (B) TYPE: amino afcid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

Met Asn Lys Leu Leu Cys Cys Ala Leu Val Phe Leu Asp He Ser lie 
15 10 15 

Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 

Glu Glu Thr Ser His Gin Leu Leu Cy3 Asp Lys Cys Pro Pro Gly Thr 
35 40 45 

Tyr Leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro 
50 55 60 

45 Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 

65 70 75 80 

Leu Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu 
85 90 95 



Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Lys Glu Gly Arg Tyr 
100 105 HO 
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Leu Glu lie Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Phe 
115 120 125 

Gly Val Val Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 140 

Cvs Pro asp Gly Phe Phe Ser Asn Glu Thr Ser Ser Lys Ala Pro Cys 
!4 5 150 ^ 155 160 

Arg Lvs His Thr Asn Cys Ser Val Phe Gly Leu Leu Leu Thr Gin Lys 
' 165 170 175 

is Glv Asn Ala Thr His Asp Asn lie Cys Ser Gly Asn Ser Glu Ser Thr 

180 ' 185 190 

Gin Lys Cys Gly He Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 

20 

Phe Ala Val Pro Thr Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val 
210 215 220 

Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie 
225 230 k 235 240 

Lvs Arg Gin His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 
245 250 255 



30 



3S 



Trp Lys His Gin Asn Lys Ala Gin Asp He Val Lys Lys lie lie Gin 
260 265 270 

Asp He Asp Leu Cys Glu Asn Ser Val Gin Arg His He Gly His Ala 
275 280 285 

Asn Leu Thr Phe Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly 
290 295 300 

Lys Lys Val Gly Ala Glu Asp He Glu Lys Thr He Lys Ala Cys Lys 
40 3 J 5 Y 310 315 320 

Pro Ser Asp Gin lie Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn 
325 330 335 



45 



SO 



SS 



Gly Asp Gin Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser 
340 345 350 

Lys Thr Tyr His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr 
355 360 365 

He Arg Phe Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu 
370 375 380 
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Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys He Ser Cys 
385 390 395 400 

Leu 



(2) INFORMATION FOR SEQ ID NO: 12 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:126: 

j 

Cys Pro Gin Gly Lys Tyr He His Pro Gin Asn Asn Ser He Cys Cys 
5 t 10 15 

Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro 

20 ' 25 30 

Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala 
35 40 45 

Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys 
50 55 60 

Glu Met Gly Gin Val Glu He Ser Ser Cys Thr Val Asp Arg Asp Thr 
65 70 75 80 

Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn 
40 85 90 95 

Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His 
100 105 110 



Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly 
115 120 125 

Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys 
130 135 



121 



BNSDOCID: <EP 0784093A1_I_> 



EP 0 784 093 A1 



(2) INFORMATION FOR SEQ ID NO:127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 
CCGGCGGACA TTTATCACAC AGCAGCTGAT GAGAAGTTTC TTCATCCA 



(2) INFORMATION FOR SEQ ID NO: 12 8: 

k 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 219 amine* acids 

(B) TYPE.: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l28: 

Met Leu Gly He Trp Thr Leu Leu Pro Leu val Leu Thr Ser Val Ala 
1 5 10 15 

Arg Leu Ser Ser Lys Ser Val Asn Ala Gin Val Thr Asp He Asn Ser 
20 25 30 

Lys Gly Leu Glu Leu Arg Lys Thr Val Thr Thr Val Glu Thr Gin Asn 
35 40 45 

Leu Glu Gly Leu His His Asp Gly Gin Phe Cys His Lys Pro Cys Pro 
50 55 60 

Pro Gly Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp Glu Pro 
65 70 75 80 
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Asp Cys Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr Asp Lys Ala His 
85 90 95 

Phe Ser Ser Lys Cys Arg Arg Cys Arg Leu Cys Asp Glu Gly His Gly 
100 105 110 

Leu Glu Val Glu lie Asn Cys Thr Arg Thr Gin Asn Thr Lys Cys Arg 
115 120 125 

Cys Lys Pro Asn Phe Phe Cys Asn Ser Thr Val Cys Glu His Cys Asp 
130 135 140 

Pro Cys Thr Lys Cys Glu His Gly lie He Lys Glu Cys Thr Leu Thr 
145 150 155 160 

Ser Asn Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser Asn Leu Gly Trp 
165 170 175 * 

Leu Cys Leu Leu Leu Leu Pro He Pro Leu He Val Trp Val Lys Arg 

180 185 190 

j 

Lys Glu Val Gin Lys Thr Cys Arg Lys His Arg Lys Glu Asn Gin Gly 
195 V 200 205 

Ser His Glu Ser Pro Thr Leu Asn Pro Glu Thr 
210 215 

(2) INFORMATION FOR SEQ ID NO: 129: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: 

Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 
1 5 10 15 

Glu Leu Leu Val Gly He Tyr Pro Ser Gly Val He Gly Leu Val Pro 
20 25 30 

His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys 
35 40 45 
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10 



20 



25 



30 



40 



45 



50 



Tyr lie His Pro Gin As 
50 



n Asn Ser He Cys Cys Thr Lys Cys His Lys 



55 60 



Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 
65 70 



75 80 



Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 
85 90 

Ara His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val 
« in"; no 



100 



Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
15 115 120 125 

Ly3 Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 
130 135 140 

Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 
14S 150 , 155 160 

Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 
165 V 170 15 



Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 
180 



185 190 



Lys Leu Cys Leu Pro Gin He Glu Asn Val Lys Gly Thr Glu Asp Ser 
195 200 205 

Glv Thr Thr Val Leu Leu Pro Leu Val He Phe Phe Gly Leu Cys Leu 
210 215 220 



Leu 



Ser Leu Leu Phe He Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys 



225 230 



235 240 



Ser Lys Leu Tyr Ser lie Val Cys Gly Lys Ser Thr Pro Glu Lys Glu 
245 250 255 

Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn Pro Ser 
260 265 270 

Phe Ser Pro Thr Pro Gly Phe Thr 
275 280 



55 
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(2) INFORMATION FOR SEQ ID NO:130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Met Leu Arg Leu lie Ala Leu Leu Val Cys Val Val Tyr Val Tyr Gly 
1 5 10 15 

20 

Asp- Asp Val Pro Tyr Ser Ser Asn Gin Gly Lys Cys Gly Gly His Asp 
20 25 30 

j 

Tyr Glu Lys Asp Gly Leu Cys Cys Ala Ser Cys His Pro Gly Phe Tyr 
25 35 40 45 

V 

Ala Ser Arg Leu Cys Gly Pro Gly Ser Asn Thr Val Cys Ser Pro Cys 
50 55 60 

30 Glu Asp Gly Thr Phe Thr Ala Ser Thr Asn His Ala Pro Ala Cys Val 

65 70 75 80 

Ser Cys Arg Gly Pro Cys Thr Gly His Leu Ser Glu Ser Gin Pro Cys 
85 90 95 

35 

Asp Arg Thr His Asp Arg Val Cys Asn Cys Ser Thr Gly Asn Tyr Cys 
100 105 110 

Leu Leu Lys Gly Gin Asn Gly Cys Arg lie Cys Ala Pro Gin Thr Lys 
40 115 120 125 

Cys Pro Ala Gly Tyr Gly Val Ser Gly His Thr Arg Ala Gly Asp Thr 
130 135 140 



45 



SO 



Leu Cys Glu Lys Cys Pro Pro His Thr Tyr Ser Asp Ser Leu Ser Pro 
145 150 155 160 

Thr Glu Arg Cys Gly Thr Ser Phe Asn Tyr lie Ser Val Gly Phe Asn 

165 170 175 

Leu Tyr Pro Val Asn Glu Thr Ser Cys Thr Thr Thr Ala Gly His Asn 

180 185 190 



55 
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Glu Val lie Lys Thr Lys Glu Phe Thr Val Thr Leu Asn Tyr Thr 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 131: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 

1-5 10 15 

j 

Trp Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 
20 \ 25 30 

Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin 
35 40 45 

Thr Ala Gin Met Cys Cys Ser Lys Cys Ser Pro Gly Gin His Ala Lys , 
50 55 60 

Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Glu Asp 
65 70 75 80 

Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cys Leu Ser Cys 
85 90 95 

Gly Ser Arg Cys Ser Ser Asp Gin Val Glu Thr Gin Ala Cys Thr Arg 
100 105 HO 

Glu Gin Asn Arg He Cys Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu 
115 120 125 

Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
130 135 140 

Pro Gly Phe Gly val Ala Arg Pro Gly Thr Glu Thr Ser Asp Val Val 
so 145 * ' 150 155 160 

Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr 
165 170 175 

ss 



20 



25 



30 



35 



40 



45 
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Asp He Cys Arg Pro His Gin He Cys Asri Val Val Ala lie Pro Gly 
180 185 190 

Asn Ala Ser Arg Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arg Ser 
195 200 205 

Met Ala Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arg Ser 
210 215 220 

Gin His Thr 
225 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

i 

(ii) MOLECULE TYPE: protein 

k 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Met Val Ser Leu Pro Arg Leu Cys Ala Leu Trp Gly Cys Leu Leu Thr 
1 5 10 15 

Ala Val His Leu Gly Gin Cys Val Thr Cys Ser Asp Lys Gin Tyr Leu 
20 25 30 

His Asp Gly Gin Cys Cys Asp Leu Cys Gin Pro Gly Ser Arg Leu Thr 
35 40 45 

Ser His Cys Thr Ala Leu Glu Lys Thr Gin Cys His Pro Cys Asp Ser 
50 55 60 

Gly Glu Phe Ser Ala Gin Trp Asn Arg Glu He Arg Cys His Gin His 
65 70 75 80 

Arg His Cys Glu Pro Asn Gin Gly Leu Arg Val Lys Lys Glu Gly Thr 
85 90 95 

Ala Glu Ser Asp Thr Val Cys Thr Cys Lys Glu Gly Gin His Cys Thr 
100 105 HO 

Ser Lys Asp Cys Glu Ala Cys Ala Gin His Thr Pro Cys He Pro Gly 
115 120 125 
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Phe Gly Val Met Glu Met Ala Thr Glu Thr Thr Asp Thr Val Cys His 
130 ' 135 "0 

Pro Cys Pro Val Gly Phe Phe Ser Asn Gin Ser Ser Leu Phe Glu Lys 
145 150 1^5 

Cvs Tyr Pro Trp Thr Ser Cys Glu Asp Lys Asn Leu Glu Val Leu Gin 
165 I 70 175 

Lys Gly Thr Ser Gin Thr Asn Val He Cys Gly Leu Lys Ser Arg Met 
180 . 185 190 

Arg Ala Leu Leu Val 
195 

(2) INFORMATION FOR SEQ ID NO: 133: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino^acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear^ 



10 



15 



25 



30 



35 



40 



45 



50 
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<ii) MOLECULE TYPE: proteih 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Val Phe Leu Asp lie He 
1 5 10 15 

Glu Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 

Pro Glu Thr Gly Arg Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr 
35 40 4 5 

Tyr Leu Lys Gin His Cys Thr Val Arg Arg Lys Thr Leu Cys Val Pro 
50 ' 55 60 

Cys Pro Asp Tyr Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 70 75 80 

Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Thr Val Lys Gin Glu 
85 90 95 
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15 



20 



25 



30 



35 



Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr 
100 105 110 

Leu Glu Leu Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Leu 
115 120 125 

Gly Val Leu Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Arg 
130 135 140 

Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 155 160 

Arg Lys His Thr Asn Cys Ser Ser Leu Gly Leu Leu Leu lie Gin Lya 
1.65 170 175 

Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr 
180 185 . 190 

Gin Asn Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 134: 

V 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

40 Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 

15 10 15 



Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 

45 20 25 30 

Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 
35 40 45 

50 Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 

50 55 60 



55 
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Glu Pro Cys Leu Asp Ser Val Thr Phe Ser Asp Val Val Ser Ala Thr 
65 ~ 70 75 80 

Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gly Leu Gin Ser Met Ser 
85 90 95 

Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cys Arg Cys Ala Tyr Gly 
100 105 HO 

Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg Val Cys 
115 120 125 

Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gin Asp Lys Gin Asn Thr 
130 135 140 

Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Ser Asp Glu Ala Asn His 
145 150 155 160 

20 

Val Asp Pro Cys Leu Pro Cys Thr Val Cys Glu Asp Thr Glu Arg Gin 
165 . 170 175 

Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Glu Cys Glu Glu He Pro 
180 V 185 190 

Gly Arg Trp He Thr Arg S4r Thr Pro Pro Glu Gly Ser Asp Ser Thr 
195 200 205 

30 Ala Pro Ser T hr Gin Glu Pro Glu Ala Pro Pro Glu Gin Asp Leu lie 

210 215 220 



25 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 135: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Met Tyr Val Trp Val Gin Gin Pro Thr Ala Phe Leu Leu Leu Gly Leu 
50 1 5 10 15 

Ser Leu Gly Val Thr Val Lys Leu Asn Cys Val Lys Asp Thr Tyr Pro 
20 25 30 

55 
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Ser Gly His Lys Cys Cys Arg Glu Cys Gin Pro Gly His Gly Met Val 
35 40 45 

Ser Arg Cys Asp His Thr Arg Asp Thr Val Cys His Pro Cys Glu Pro 
50 55 60 

Gly Phe Tyr Asn Glu Ala Val Asn Tyr Asp Thr Cys Lys Gin Cy3 Thr 
65 70 75 80 

Gin Cys Asn His Arg Ser Gly Ser Glu Leu Lys Gin Asn Cys Thr Pro 
85 90 95 

Thr Glu Asp Thr Val Cys Gin Cys Arg Pro Gly Thr Gin Pro Arg Gin 
100 105 110 

Asp Ser Ser His Lys Leu Gly Val Asp Cys Val Pro Cys Pro Pro Gly 
115 120 125 

His Phe Ser Pro Gly Ser Asn Gin Ala Cys Lys Pro Trp Thr Asn Cys 
130 13% 140 

Thr Leu Ser Gly Lys Gin I&e Arg His Pro Ala Ser Asn Ser Leu Asp 
145 150 ^ 155 160 

Thr Val Cys Glu Asp Arg Ser Leu Leu Ala Thr Leu Leu Trp Glu Thr 
165 170 175 

30 Gin Arg Thr Thr Phe Arg Pro. Thr Thr Val Pro Ser Thr Thr Val Trp 

. 180 185 190 

Pro Arg Thr Ser Gin Leu Pro Ser Thr Pro Thr Leu Val 
195 200 205 



70 
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(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 191 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Met Gly Asn Asn Cys Tyr Asn Val Val Val lie Val Leu Leu Leu Val 
15 10 15 



131 



BNSDOCID: <EP 0784093A1 J_> 



EP 0 784 093 A1 



70 



15 



20 



30 



35 



40 



45 



SO 



Gly Cys Glu Lys Val Gly Ala Val Gin Asn Ser Cys Asp Asn Cys Gin 
20 25 30 

Pro Gly Thr Phe Cys Arg Lys Tyr Asn Pro Val Cys Lys Ser Cys Pro 
35 40 45 

Pro Ser Thr Phe Ser Ser He Gly Gly Gin Pro Asn Cys Asn He Cys 
50 55 60 

Arc Val Cys Ala Gly Tyr Phe Arg Phe Lys Lys Phe Cys Ser Ser Thr 
65 70 75 80 

His Asn Ala Glu Cys Glu Cys He Glu Gly Phe His Cys Leu Gly Pro 
85 90 95 

Gin Cys Thr Arg Cys Glu Lys Asp Cys Arg Pro Gly Gin Glu Leu Thr 
100 105 110 

Lys Gin Gly Cys Lys Thr Cys Ser Leu Gly Thr Phe Asn Asp Gin Asn 
115 ,120 125 



Gly Thr Gly Val Cys Arg Pro Trp Thr Asn Cys Ser Leu Asp Gly Arg 

130 J 1* I 40 

Ser Val Leu Lys Thr Gly Tttr Thr Glu Lys Asp Val Val Cys Gly Pro 

145 ISO 155 160 



Pro Val Val Ser Phe Ser Pro Ser Thr Thr He Ser Val Thr Pro Glu 
165 170 175 

Gly Gly Pro Gly Gly' His Ser Leu Gin Val Leu Thr Leu Phe Leu 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNE S S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:137: 
TATGGATGAA GAAACTTCTC ATCAGCTGCT GTGTGATAAA TGTCCGCCGG GTAC 54 
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(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

Glu Thr Leu Pro Pro Lys Tyr Leu His Tyr Asp Pro Glu Thr Gly His 
1 5 10 15 * 

20 

Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr Tyr Leu Lys Gin His 
20 25 30 

j 

Cys Thr Val Arg Arg Lys Thr Leu Cys Val Pro Cys Pro Asp His Ser 
35 40 45 

25 V 

Tyr Thr Asp Ser Trp His Ttrr Ser Asp Glu.Cys Val Tyr Cys Ser Pro 
50 55 60 



Val Cys Lys Glu Leu Gin Ser Val Lys Gin Glu Cys Asn Arg Thr His 
65 70 75 80 

Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr Leu Glu lie Glu Phe 
85 90 95 

Cys Leu Lys His Arg Ser Cys Pro Pro Gly Ser Gly Val Val Gin Ala 
100 105 1 110 

Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Lys Cys Pro Asp Gly Phe 
115 120 125 

Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro Cys lie Lys His Thr Asn 
130 135 140 

45 Cys Ser Thr Phe Gly Leu Leu Leu lie Gin Lys Gly Asn Ala Thr His 

145 150 155 160 

Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr Gin Lys Cys Gly lie 
1.65 170 175 

50 

Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr 
180 185 190 

55 
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Lys lie lie Pro Asn Trp Leu Ser Val Leu Val Asp Ser Leu Pro Gly 
195 200 205 

Thr Lys Val Asn Ala Glu Ser Val Glu Arg He Lys Arg Arg His Ser 
210 215 220 

Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn 
225 230 235 240 

Arg Asp Gin Glu Met Val Lys Lys He He Gin Asp He Asp Leu Cys 
245 250 255 

Glu Ser Ser Val Gin Arg His Leu Gly His Ser Asn Leu Thr Thr Glu 
15 260 265 270 

Gin Leu Leu Ala Leu Met Glu Ser Leu Pro Gly Lys Lys He Ser Pro 
275 280 285 

20 Glu Glu He Glu Arg Thr Arg Lys Thr Cys Lys Ser Ser Glu Gin Leu 

290 295 300 

Leu Lys Leu Leu Ser Leu Tip Arg He Lys Asn Gly Asp Gin Asp Thr 
305 310 . 315 320 

25 v 

Leu Lys Gly Leu Met Tyr Ala Leu Lys His Leu Lys Thr Ser His Phe 
325 330 335 

Pro Lvs Thr Val Thr His Ser Leu Arg Lys Thr Met Arg Phe Leu His 
30 340 345 350 

Ser Phe Thr Met Tyr Arg Leu Tyr Gin Lys Leu Phe Leu Glu Met He 
355 360 365 



35 



40 



Gly Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO:139: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp Glu Glu Thr Ser His 
1 5 10 15 

Gin Leu Leu Cys Asp Lys Cys Pro Pro Gly Thr Tyr Leu Lys Gin His 
20 25 30 

Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro Cys Pro Asp His Tyr 
35 40 45 

Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys Leu Tyr Cys Ser Pro 
50 55 ' ~ 60 

Val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu Cys Asn Arg Thr His 
65 70 75 80 

Asn Arg Val Cys Glu Cys Lys Glu Gly Arg Tyr Leu Glu lie Glu Phe 
85 90 95 

j 

Cys Leu Lys His Arg Ser Cys Pro Pro Gly Phe Gly Val Val Gin Ala 
100 105 110 

k. 

Gly Thr Pro Glu Arg Asn. Thr Val Cys Lys Arg Cys Pro Asp Gly Phe 
115 * 120 125 

Phe Ser Asn Glu Thr Ser Ser Lys Ala Pro Cys Arg Lys His Thr Asn 
130 135 140 

Cys Ser Val Phe Gly Leu Leu Leu Thr Gin Lys Gly Asn Ala Thr His 
145 150 155 160 

Asp Asn lie Cys Ser Gly Asn Ser Glu Ser Thr Gin Lys Cys Gly lie 
165 170 175 

Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg Phe Ala Val Pro Thr 
180 185 190 

Lys Phe Thr Pro Asn Trp Leu Ser Val Leu Val Asp Asn Leu Pro Gly 
195 200 205 

Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie Lys Arg Gin His Ser 
210 215 220 

Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu Trp Lys His Gin Asn 
225 230 235 240 

Lys Ala Gin Asp lie Val Lys Lys lie lie Gin Asp lie Asp Leu Cys 
245 ~* ~ 250 255 



07S4093A1J_> 



135 



EP 0 784 093 A1 



Glu Asn Ser Val Gin Arg His He Gly His Ala Asn Leu Thr Phe Glu 
260 265 270 

Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly Lys Lys Val Gly Ala 
275 280 . 285 

Glu Asp He Glu Lys Thr He Lys Ala Cys Lys Pro Ser Asp Gin He 
290 295 300 

Leu Lys Leu Leu Ser Leu Trp Arg He Lys Asn Gly Asp Gin Asp Thr 
305 310 315 320 

Leu Lys Gly Leu Met His Ala Leu Lys His Ser Lys Thr Tyr His Phe 
325 330 335 

Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr He Arg Phe Leu His 
340 345 350 

Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met He 
355 360 365 

Gly Asn Gin Val Gin Ser Val Lys He Ser Cys Leu 
370 375 380 

V 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
TGGACCACCC AGAAGTACCT TCATTATGAC 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 
GTCATAATGA AGGTACTTCT GGGTGGTCCA 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 142 
GGACCACCCA GCTTCATTAT GACGAAGAAA C 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 
GTTTCTTCGT CATAATGAAG CTGGGTGGTC C 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
GTGGACCACC CAGGACGAAG AAACCTCTC 
(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
GAGAGGTTTC TTCGTCCTGG GTGGTCCAC 
(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i > MOLECULE TYPE : cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 
CGTTTCCTCC AAAGTTCCTT CATTATGAC 
(2) INFORMATION FOR SEQ ID NO:147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147 
GTCATAATGA AGGAACTTTG GAGGAAACG 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 
GGAAACGTTT CCTGCAAAGT ACCTTCATTA TG 
(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

CATAATGAAG GTACTTTGCA GGAAACGTTT CC 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

CACGCAAAAG TCGGGAATAG ATGTCAC 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 27 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:151: 
GTGACATCTA TTCCCGACTT TTGCGTG 
(2) INFORMATION FOR SEQ ID NO: Jf52 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 
CACCCTGTCG GAAGAGGCCT TCTTC 
(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 
GAAGAAGGCC TCTTCCGACA GGGTG 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 

k 

TGACCTCTCG GAAAGCAGCG TGCA 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 
TGCACGCTGC TTTCCGAGAG GTCA 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
CCTCGAAATC GAGCGAGCAG CTCC 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SfeQ ID NO: 157: 
CGATTTCGAG GTCTTTCTCG TTCTC ^ 
(2) INFORMATION FOR SEQ ID NO:158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 
CCGTGAAAAT AAGCTCGTTA TAACTAGGAA TGG 
(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
CCATTCCTAG TTATAACGAG CTTATTTTCA CGG 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

j 

CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 
(2) INFORMATION FOR SEQ ID NO:fc.61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
CCTCTCTCGA GTCAGGTGAC ATCTATTCCA CACTTTTGCG TGGC 
(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 
(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION : . SEQ ID NO: 163: 

V. 

CCTCTCTCGA GTCAAGGAAC AGCAAACCTG AAGAAGGC 

V 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 
CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 
(2) INFORMATION FOR SEQ ID NO: 165: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

5 

CCTCTCTCGA GTCACTCTGT GGTGAGGTTC GAGTGGCC 38 
(2) INFORMATION FOR SEQ ID NO: 166: 

10 (i) SEQUENCE CHARACTERISTICS :' 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: cDNA 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

j 

CCTCTGAGCT CAAGCTTCCG AGGACCACAA TGAACAAG 38 
(2) INFORMATION FOR SEQ ID NO:^167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
CCTCTCTCGA GTCAGGATGT TTTCAAGTGC TTGAGGGC 38 
(2) INFORMATION FOR SEQ ID NO: 168: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acid3 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Met Lys His His His His His His His Ala Ser Val Asn Ala Leu Glu 
1 5 10 15 



Claims 



10 



25 



1 . An isolated nucleic acid encoding a polypeptide comprising at least one of the biological activities of OPG wherein 
the nucleic acid is selected from the group consisting of: 

a) the nucleic acids shown in Figures 2B-2C (SEQ ID NO: 120), 9A-9B (SEQ ID NO: 122), and 9C-9D (SEQ 
15 ID NO: 124) or complementary strands thereof; 

b) nucleic acids which hybridize under stringent conditions with the polypeptide-encoding regions as shown 
in Figures 2B-2C (SEQ ID NO: 120), 9A-9B (SEQ ID NO: 122) and 9C-9D (SEQ ID NO: 124); 

c) nucleic acids which hybridize under stringent conditions with nucleotides 1 48 through 337 inclusive as shown 
in Figure 1 A; and 

20 d) nucleic acid which are degenerate to the nucleic acids of (a), (b) and (c). 

2. The nucleic acid of Claim 1 which is cDNA, genomic DNA, synthetic DNA or RNA. 

3. A polypeptide encoded by the nucleic acid of Claim 1 . 

4. The nucleic acid of Claim 1 including one or more codons preferred for Escherichia coli expression. 

5. The nucleic acid of Claim 1 having a detectable label attached thereto. 

30 6. The nucleic acid of Claim 1 comprising the polypeptide-encoding region of Figure 2B-2C (SEQ ID NO: 120), Figure 
9A-9B (SEQ ID NO: 122) or Figure 9C-9D (SEQ ID NO: 124). 

7. The nucleic acid of Claim 6 haying the sequence as shown in Figure 9B from nucleotides 158-1297. 
35 8. An expression vector comprising the nucleic acid of Claim 1 . 

9. The expression vector of Claim 8 wherein the nucleic acid comprises the polypeptide - encoding region as shown 
in Figure 9C-9D (SEQ ID NO: 124). 

40 1 0. A host cell transformed or transfected with the expression vector of Claim 8. 

11. The host cell of Claim 10 which is a eucaryotic cell. 

12. The host cell of Claim 1 1 which is selected from the group consisting of CHO, COS, 293, 3T3, C V-1 and BHK cells. 

1 3. The host cell of Claim 1 0 which is a procaryotic cell. 

14. The host cell of Claim 13 which is Escherichia coli. 

so 15. A transgenic mammal comprising the expression vector of Claim 8. 

1 6. The transgenic mammal of Claim 1 5 which is a rodent. 

17. The transgenic mammal of Claim 16 which is a mouse. 

18. A process for the production of OPG comprising: 
growing under suitable nutrient conditions host cells transformed or transfected with the nucleic acid of Claim 
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1 ; and 

isolating the polypeptide products of the expression of the nucleic acids. 

19. A purifed and isolated polypeptide comprising OPG. 

5 

20. The polypeptide of Claim 19 which is mammalian OPG. 

21. The polypeptide of Claim 20 which is human OPG. 

io 22. The polypeptide of Claim 1 9 which is substantially free of other human proteins. 

23. The polypeptide of Claim 21 having the amino acid sequence as shown in Figure 2B-2C (SEQ ID NO: 1 21 ), Figure 
9A-9B (SEQ ID NO: 123), or Figure 9C-9D (SEQ ID NO: 125) or a derivative thereof. 

is 24. The polypeptide of Claim 23 having the amino acid sequence as shown in Figure 9C-9D (SEQ ID NO:125) from 
residues 22-401 inclusive. 

25. The polypeptide of Claim 23 having the amino acid sequence as shown in Figure 9C-9D (SEQ ID NO: 125) from 
residues 32-401 inclusive. 

20 

26. The polypeptide of Claim 19 which is characterized by being a product of expression of an exogenous DNA se- 
quence. 

27. The polypeptide of Claim 26 wherein the DNA is cDNA, genomic DNA or synthetic DNA. 

25 

28. The polypeptide of Claim 1 9 which has been modified with a water-soluble polymer. 

29. The polypeptide of Claim 28 wherein the water soluble polymer is polyethylene glycol. 

30 30. A polypeptide comprising: 

an amino acid sequence of at least about 1 64 amino acids comprising four cysteine-rich domains characteristic 
of the cysteine rich domains of tumor necrosis factor receptor extracellular regions; and 
an activity of increasing bone density. 

35 

31. A polypeptide comprising the amino acid sequence as shown in Figure 2B-2C (SEQ ID NO: 121), Figure 9A-9B 
(SEQ ID NO: 123) or Figure 9C-9D (SEQ ID NO: 125) having an amino terminus at residue 22, and wherein from 
1 to 216 amino acids are deleted from the carboxy terminus. 

40 32. The polypeptide of Claim 31 comprising the amino acid sequence from residues 22-1 85, 22-1 89, 22-1 94, or 22-201 
inclusive. 

33. The polypeptide of Claim 32 further comprising an Fc region of human IgG 1 extending from the carboxy terminus. 

45 34. A polypeptide comprising the amino acid sequence as shown in Figure 2B-2C (SEQ ID NO: 121), Figure 9A-9B 
(SEQ ID NO: 123) or Figure 9C-9D (SEQ ID NO: 125) having an amino terminus at residue 22, wherein from 1 to 
10 amino acids are deleted from the amino terminus and, optionally from 1 to 216 amino acids are deleted from 
the carboxy terminus. 

so 35. The polypeptide of Claim 34 comprising the amino acid sequence from residues 27-1 85, 27-1 89, 27-1 94, 27-401 , 
or 32-401 inclusive. 

36. The polypeptide of Claim 35 further comprising an Fc region of human lgG1 extending from the carboxy terminus. 

55 37. a polypeptide selected from the group consisting of: 

huOPG [22-201 ]-Fc 
huOPG [22-401 ]-Fc 
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huOPG [22-180]-Fc 

huOPG met [22-401 ]-Fc 

huOPG Fc-met [22-401] 

huOPG met [22-185] 
s huOPG met [22-189] 

huOPG met [22-194] 

huOPG met [27-185] 

huOPG met [27-189] 

huOPG met [27-194] 
10 huOPG met [32-401] 

huOPG met-lys[22-401] 

huOPG met [22-401] 

huOPG met [22-401 ]-Fc (P25A) 

huOPG met [22-401] (P25A) 
15 huOPG met [22-401] (P26A) 

huOPG met [22-401] (P26D) 

huOPG met [22-194] (P25A) 

huOPG met [22-1 94] (P26A) 

huOPG met met-(lys)3 [22-401] 
20 huOPG met met-arg-gly-ser-(his)6 [22-401] 

38. A nucleic acid encoding the polypeptide of Claim 37. 

39. An antibody or fragment thereof which specifically binds to OPG. 

40. The antibody of Claim 39 which is a monoclonal antibody. 

41. A method for detecting the presence of OPG in a biological sample comprising: 
incubating the sample with the antibody of Claim 39 under conditions that allow binding of the antibody to 
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OPG; and 

detecting the bound antibody. 

42. A method to assess the ability of a candidate substance to bind to OPG comprising: 

incubating OPG with the candidate substance under conditions that allow binding; and 
measuring the bound substance. 

43. A method of regulating the levels of OPG in an animal comprising modifying the animal with a nucleic acid encoding 
40 OPG. 

44. The method of Claim 43 wherein the nucleic acid promotes an increase in the tissue level of OPG. 

45. The method of Claim 44 wherein the animal is a human. 

46. A pharmaceutical composition comprising a therapeutically effective amount of OPG in a pharmaceutical^ accept- 
able carrier, adjuvant, solubilizer, stabilizer and/or antioxidant. 

47. The composition of Claim 46 wherein the OPG is human OPG. 

48. The composition of Claim 47 wherein the OPG has the amino acid sequence as shown in Figure 9B. 

49. A method of treating a bone disorder comprising administering a therapeutically effective amount of the polypeptide 
of Claim 19. 

50. The method of Claim 49 wherein the polypeptide is human OPG. 

51. The method of Claim 49 wherein the bone disorder is excessive bone loss. 
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52. The method of Claim 51 wherein the bone disorder is selected from the group consisting of osteoporosis, Paget's 
disease of bone, hypercalcemia, hyperparathyroidism, steroid-induced osteopenia, bone loss due to rheumatoid 
arthritis, bone loss due to osteomyelitis, osteolytic metastasis, and periodontal bone loss. 

$ 53. The method of Claim 49 further comprising administering a therapeutically effective amount of a substances se- 
lected from the group consisting of bone morphogenic proteins BMP-1 through BMP-12, TGF-0 family members, 
IL-1 inhibitors, TNFa inhibitors, parathyroid hormone and analogs thereof, parathyroid hormone related protein 
and analogs thereof, E series prostaglandins, bisphosphonates, and bone-enhancing minerals. 

10 54. An osteoprotegerin multimer consisting of osteoprotegerin monomers. 

55. The multimer of Claim 54 which is a dimer. 

56. The multimer of Claim 54 formed by interchain disulfide bonds. 

15 

57. The multimer of Claim 54 formed by association Fc regions derived from human lgG1 . 

58. The multimer of Claim 54 which is essentially free of osteoprotegerin monomers and inactive multimers. 

20 59. The multimer of Claim 54 wherein the monomers comprise the amino acid sequence as shown in Figure 9C-9D 
(SEQ ID NO: 125) from residues 22-401, or a derivative thereof. 

60. The multimer of Claim 54 wherein the monomers comprise the amino acid sequence shown in Figure 9C-9D (SEQ 
ID NO: 1 25) from residues 22-1 94. 
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FIG.2A 

AUG TAG 

FIG.2B 

in 30 50 

ATCAAAGGCAGGGCATACTTCCTGTTGCCCAGACCTTATATAAAACGTCATGTTCGCCTG 

ggcagcagIgaagcacctagcactggcccagcggctgccgcctgaggtttccagaggacc 

130 150 1/u 

acaatgaacaagtggctgtcctgtgcactcctggtgttcttggacatca™ 

HNffWTrCrft T, T. V E L P T T B W T 

190 210 

acccaggaaacctttcctccaaaatacttgcattatgacccagaaaccggacgtcagctc 
toe tfppkylhydpet^g^rql 

TTGTGTGACAAATGTGCTCCTGGCACCTACCTAAAACAG^ 
LCDKCAPGTYLKQHCT V 

310 33° 3 

ACACTGTGTGTCCCTTGCCCTGACTACTCTTATACAGACAGC 

TLCVPCPDYSYTDSWHTSDH 

370 390 410 

TCCGTCTACTCCAGCCCCGTC^ 

C V Y C S P V C K E L Q T V K Q B C H. 

d30 450 4VO 

ACCCACAACCGAGTGTGCGAATGTGAGGAAGGGCGCTACCTGGAGCT 

T H N R V C E C E E^G R Y L 530 

aagcaccgSagctgtcccccac^cttgggtgtgctgcaggc^ 

K „ R S C P P G L G 7q V L Q A G T 5 P q E R N 
T V C K R C P D G F F S G E T S S K 



610 630 650 

VCACCAACTGC 
T H C 

ACACATCACAATGTATGTTCCGGAAACAGAGAAGCAACTCA 



TGTAGGAAACACACCAACTGCA^ 

670 690° 710 



C R K H T H C S 



T H D N V C S G N R E A T Q N C G I 
730 750 



ACCCTGTGCGAAGAGGCATTCTTCAGGTTTGCTGTGCCTACCAAGA 
C TG AG TG TT CTGGTG G ACAGTTTGCCTGGGACC AAAGTG AATGCAG AGA^TGTAGAG AGG 



'C"» E A F F R F A V P T K I I P N W 
790 810 _.! 3 ?. 



790 alu 

_^^^^^m^/^»nxr?'T««twTv-ir , r ,, nr:rif5Ar:c 

KVNA ESVER 



SVLVDSLPGT 



ATAAAACGGAGACACAGCTCGCAAGAGCAAACTTTCCAGCT 
IKRRHSSQE Q T F Q L L K W K 

CAAAACAGAGACCAGGAAATGGTGAAGJ^GATCATCCAA 

Q N Rq D Q E M V K K I U 10 10 

AGTGTGCAACGGCATATCGGCCACGCGAACCTCACCACAGAGC 

SVQRHIGHAHIjA 1 w 
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FIG.2C 



1030 1050 1070 

GAGAGCTTGCCTGGGAAGAAGATCAGCCCAGACGAGATTGAGAGAACGAGAAAGACCTGC 
ESLPGKKI SPDE I ERTRKTC 

1090 1110 1130 

AAACCCAGCGAGCAGCTCCTGAAGCTACTGAGCTTGTGGAGGATCAAAAATGGAGACCAA 
KPSEQLLKLLSLWR IKNGDQ 

1150 1170 1190 

GACACCTTGAAGGGCCTGATGTACGCACTCAAGCACTTGAAAGCATACCACTTTCCCAAA 
DT LKGLMYALKHLiK AYHF P K 

1210 1230 1250 

ACCGTCACCCACAGTCTGAGGAAGACCATCAGGTTCTTGCACAGCTTCACCATGTACCGA 
TV THSLRKTIRFLH SFTMYR 

1270 1290 1310 

TTGTATCAGAAACTCTTTCTAGAAATGATAGGGAATCAGGTTCAATCAGTGAAGATAAGC 
LYQKLFLEMIGNQVQSVK I S 

1330 1350 1370 

TGCTTATAGTTAGGAATGGTCACTGGGCTGTTTCTTCAGGATGGGCCAACACTGATGGAG 
C L 

1390 1410 1430 

CAGATGGCTGCTTCTCCGGCTCTTGAAATGGCAGTTGATTCCTTTCTCATCAGTTGGTGG 

1450 1470 1490 

GAATGAAGATCCTCCAGCCCAACACACACACTGGGGAGTCTGAGTGAGGAGAGTGAGGCA 

1510 1530 1550 

GGCTATTTGATAATTGTGCAAAGCTGCCAGGTGTACACCTAGAAAGTCAAGCAGCCTGAG 

1570 1590 1610 

AAAGAGGATATTTTTATAACCTCAAACATAGGCCCTTTCCTTCCTCTCCTTATGGATGAG 

1630 1650 1670 

TACTCAGAAGGCTTCTACTATCTTCTGTGTCATCCCTAGATGAAGGCCTCTTTTATTTAT 

1690 1710 1730 

TTTTTTATTCTTTTTTTCGGAGCTGGGGACCGAACCCAGGGCCTTGCGCTTGCGAGGCAA 

1750 1770 " 1790 

GTGCTCTACCACTGAGCTAAATCTCCAACCCCTGAAGGCCTCTTTCTTTCTGCCTCTGAT 

1810 1830 1850 

AGTCTATGACATTCTTTTTTCTACAATTCGTATCAGGTGCACGAGCCTTATCCCATTTGT 

1870 1890 1910 

AGGTTTCTAGGCAAGTTGACCGTTAGCTATTTTTCCCTCTGAAGATTTGATTCGAGTTGC 

1930 1950 1970 

AGACTTGGCTAGACAAGCAGGGGTAGGTTATGGTAGTTTATTTAACAGACTGCCACCAGG 

1990 2010 2030 

AGTCCAGTGTTTCTTGTTCCTCTGTAGTTGTACCTAAGCTGACTCCAAGTACATTTAGTA 

2050 2070 2090 

TGAAAAATAATCAACAAATTTTATTCCTTCTATCAACATTGGCTAGCTTTGTTTCAGGGC 

2110 2130 2150 

ACTAAAAGAAACTACTATATGGAGAAAGAATTGATATTGCCCCCAACGTTCAACAACCCA 

2170 2190 2210 

ATAGTTTATCCAGCTGTCATGCCTGGTTCAGTGTCTACTGACTATGCGCCCTCTTATTAC 

2230 2250 2270 

TGCATGCAGTAATTCAACTGGAAATAGTAATAATAATAATAGAAATAAAATCTAGACTCC 

2290 2310 2330 

ATTGGATCTCTCTGAATATGGGAATATCTAACTTAAGAAGCTTTGAGATTTCAGTTGTGT 

2350 2370 2390 

TAAAGGCTTTTATTAAAAAGCTGATGCTCTTCTGTAAAAGTTACTAATATATCTGTAAGA 

2410 2430 
CTATTACAGTATTGCTATTTATATCCATCCAG 
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FIG.6A 
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FIG.7A FIG.7B 
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FIG.9A 



10 30 50 

CCTTATATAARACGTCATGATTGCCTGGGCTGCAGAGACGCACCTAGCACTGACCCAGCG 

70 90 110 

GCTGCCTCCTGAGGTTTCCCGAGGACCACAATGAACAAGTGGCTGTGCTGCGCACTCCTG 

M N K W L C C & L L 

130 150 170 

GTGCTCCTGGACATCAIT'GAATGGACAACCCAGGAAACCCTTCCTCCAAAGTACTTGCAT 

V L lu n T T K W T T O P. T L P P K Y L H 

190 210 230 

TATGACCCAGAAACTGGTCATCAGCTCCTGTGTCACAAATGTGCTCCTGGCACCTACCTA 

V D P E T G H Q L L C D K C A P G T Y L 

250 270 290 

AAACAGCACTGCACAGTGAGGAGGAAGACATTGTGTGTCCCTTGCCCTGACCACTCTTAT 
K Q H C T V R R K T L C V P C P D H S Y 

310 330 350 

ACGGACAGCTGGCACACCAGTGATGAGTGTGTGTATTGCAGCCCAGTGTGCAAGGAACTG 
T D S W H T S D E C V Y C S P V C K E L 

370 390 410 

CAGTCCGTGAAGCAGGAGTGCAACCGCACCCACAACCGAGTGTGTGAGTGTGAGGAAGGG 
Q S V K Q E C H R T H N R V C E C E E G 

430 450 470 

CGTTACCTGGAGATCGAATTCTGCTTGAAGCACCGGAGGTGTCCCCCGGGCTCCGGCGTG 
R Y L E. I E F C L K H R S C P P G S G V 

490 510 530 

GTGCAAGCTGGAACCCCAGAGCGAAACACAGTTTGCAAAAAATGTCCAGATGGGTTCTTC 

V Q A G T PER N T V C K K C P D G F F 

550 570 590 

tcaggtgagacttcatcgaaaggaccctgtataAaacacacgaactgcagcacatttggc 
s g e t s s k a p c i k h t m c s t f g 

610 630 650 

CTCCTGCTAATTCAGAAAGGAAATGCAACACATGACAACGTGTGTTCCGGAAACAGAGAA 
LLLIQKGHATHDNVC SGN R E 

670 690 710 

GCCACGCAAAAGTGTGGAATAGATGTCACCCTGTGTGAAGAGGCCTTCTTCAGGTTTGCT 

A TQ K C G I D VT LC E E AF F R F A 

730 750 770 

GTTGCTACCAAGATTATACCAAATTGGCTGAGTGTTTTGGTGGACAGTTTGCCTGGGACC 
VPTKI IPNWLSVLVDSLPGT 
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FIG.9B 



M E S L P G K K I S 
1050 1070 



790 810 830 

AAAGTGAATGCCGAGAGTGTAGAGAGGATAAAACGGAGACACAGCTCACAAGAGCAAACC 

K VNAE S V E R I K R RH S S Q EQT 

850 870 890 

TTCCAGCTGCTGAAGCTGTGGAAACATCAAAACAGAGACCAGGAAATGGTGAAGAAGATC 

FOLLKLWKHQNRDQEMVKKI 

910 930 950 

ATCCAAGACATTGACCTCTGTGAAAGCAGCGTGCAGCGGCATCTCGGCCACTCGAACCTC 

I 0 DID L CESS V Q R H Ii O H S H L 
970 990 ^ _ ,_10i£ 

ACCACA 
T T E 0 
1030 

GAGATTGAGAC>«-« s^w* ««• m — — - ----- 

EIE RTRKTCKSSEQ L L K_ L L S 

1090 1110 1130 

T^ATGGAGGATCAAAAATGGTCACCAAGACACCTTGAAGGGCCTGATGTATGCCCTCAAG 

L" W R I K N G D Q D T L K G L M Y A L K 

1150 1170 1190 

CAGTTGAAAACATCCCACTT'rCCCAAAACTGTCACCCACAGTCTGAGGAAGACCATGAGG 

H L K T S H F P K T V T H S L R R_ R 

1210 1230 1 250 

TTCCTGCACAGCTTCACAATGTACAGACTGTATCAGAAGCTCTTTTTAGAAATGATAGGG 

F li H S F T M Y R L Y Q K h F I» E M I G 

1270 1290 1310 

AATCAGGTTCAATCCGTGAAAATAAGCTGCTTATAACTAGGAATGGTCACTGGGCTGTTT 

N Q V Q S V K I S C L 
CTTCA 
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FIG.9C 



10 30 50 

GTATATATAACGTGATGAGCGTACGGGTGCGGAGACGCACCGGAGCGCTCGCCCAGCCGC 

70 90 110 

CGYCTCCAAGCCCCTGAGGTTTCCGGGGACCACAATGAACAAGTTGCTGTGCTGCGCGCT 

M N K L L C C & L 

130 150 170 

CGTGTTTCTGGACATCTCCATTAAGTGGACCACCCAGGAAACGTl'TCCTCCAAAGTACCT 
VP T, D T S I K WTTO E T F P P K Y L 
190 210 230 

TCATTATGACGAAGAAACCTCTCATCAGCTGTTGTGTGACAAATGTCCTCCTGGTACCTA 
HY D E E T S H QLLC D KCPPG TY 
250 270 290 

CCTAAAACAACACTGTACAGCAAAGTGGAAGACCGTGTGCGCCCCTTGCCCTGAGCACTA 
LKQHCT AKWK TVCAPCPDHY 

310 330 ■< 350 ■ -r-y^ • 

CTACACAG AC AGCTGGC ACACCAGTGACGAGTGTCTATACTGCAGCCCCGTGTGGAAGGA 

Y T D S W H T S D E C L Y C S P V C K E 

370 ; 390. 410 

GCTGCAGTACGTCAAGCAGGAGTGCAATCGCACCCACAACCGCGTGTGCGAATGGAAGGA 
LQYVKQ E C H R T H N RVCE C K E 
430 450 470 

AGGGCGCTACCTTGAGATAGAGTTCTGCTTGAAACATAGGAGGTGCCCTCCTGGATTTGG 
G R Y L E I E F C L K H R S C P P G F G 
'490 510 ^ 530 ■ 

AGTGGTGCAAGCTGGAACCCCAGAGCGAAATACAGTTTGCAAAAGATGTCCAGATGGGTT 

V V Q A G T P E R N T V C . K ... R .. C P D G, F 

550 570 590 

CTTCTCAAATGAGACGTCATCTAAAGCACCCTGTAGAAAACACACAAATTGCAGTGTCTT 
F S M E T S S K ... A P C R K H T M C S V F 

610 630 650 

TGGTCTCCTGCTAACTCAGAAAGGAAATGCAACACACGACAACATATGTTCCGGAAACAG 

gllltqkgmath d n I c s g U . s 

670 690 710 

TGAATCAACTCAAAAATGTGGAATAGATGTTACCCTGTGTGAGGAGGCATTCTTCAGGTT 
E STQKCG IDVTLCEEAFFRF 
730 750 770 

TGCTGTTCCTACAAAGTTTACGCCTAACTGGCTTAGTGTCTTGGTAGACAATTTGCCTGG 
AVPTKFTPNWLSVLVDNLPG 
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FIG.9D 



790 810 830 

CACCAAAGTAAACGCAGAGAGTGTAGAGAGGATAAAACGGCAACACAGCTCACAAGAACA 

T K V N A E S V E R I K R Q H S S Q E Q 

850 870 
GACTTTCCAGCTGCTGAAGTTATGGAAACATCAAAACAAAGACCAAGATATAG^TCAAGAA 

T F Q L L> K L W K H Q N K D Q D - I V K K 
910 930 950 

GATCATCCAAGATATTGACCTCTGTGAAAACAGCGTGCAGCGGCACATTgGACATGCTAA 

I I Q D I D L C E N S V Q R H I.;G H A H 
1 970 990 1010 



L T F E Q L R S L M E S L P G K K V G A 
1030 1050 1070 

AGAAGACATTGAAAAAACAATAAAGGCATGCAAAGCCAGTGACCAGATCCTGAAGCTGOT 

E DIE K T I K A C K P S D Q I._L K L L 
1090 1110 1130 

CAGTTTGTGGCGAATAAAAAATGGCGACCAAGACACCTTGAAGGGCCTAATGCACGCACT 

S I* W R I K N G D Q D T L K G V-ion ' 
1150 1170 1190 

AAAGCACTCAAAGACGTACCACTTTCCCAAAACTCTCACTCAGAGTCTAAAGAAGACCAT 

K H S K T Y H F P K T V T Q S V .K K T - I 
1210 1230 125>0 

cAggttccttcacagcttcacaatgtacaaattgtatcagaagttatttttagaaatga 

RFLHS FTMYKLYQK L F^L.E ,M...,I.. 
1270 1290 1310 

aggtaaccaggtccaatcagtaaaaataagctgcttataactggaaatggccattgagct 

G N Q V Q S V K I S C L 
1330 1350 

gtttcctcacaattggcgagatcccatggatgataa 
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